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(57) Abstract 

A eukaryotic cell transformed with a DNA construct comprising a coding sequence encoding a polypeptide having the ac- 
tivity of a mammalian phospholipid kinase or a mammalian protein kinase activated by a phospholipid or its metabolite and re- 
gulatory elements to allow transcription of the coding sequence in the said cell wherein the regulatory elements include a repressi- 
ve or inducible promoter and the expression of the said coding sequence is lethal or growth inhibitory to the cell. In a preferred 
embodiment the cells are Schizosaccharomyce pombe. The cells are used as the basis of an assay for compounds involved in cell 
growth regulation. Such compounds can be used to treat cancers and the formation of blood vessel plaques. 
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The present invention relates to assays for compounds involved in cell growth 
regulation, and more particularly to those involved in transducing signals from 
hormones, growth factors and oncogenes. Such compounds .represent potential 
drugs or targets for drugs to treat cancers and to prevent the formation of 
plaques which cause heart disease. 

* 

Phosphatidylinositol 3-OH kinase (Ptdlns 3-kinase) catalyses the 
phosphorylation of the 3-hydroxyl of inositol in Ptdlns, PtdIns-4-phosphate or 
in PtdIns-4,5-bisphosphate. This activity is involved in transducing signals 
from a number of hormones, growth factors and oncogenes. The standard 
assay for the activity of the Ptdlns 3-kinase involving lipid moieties does not 
readily lend itself to a screen for potential inhibitors (or activators) of catalytic 
function. Members of the protein kinase C family of enzymes are involved in 
transducing signals from a number of hormones, growth factors and oncogenes. 
The standard assay for protein kinase C does not lend itself to a screen for 
potential inhibitors (or activators) of catalytic function. 

Thus it has been desirable to investigate other means of searching for 
inhibitors. 

A first aspect of the invention provides a eukaryotic cell transformed with a 
DNA construct comprising a coding sequence encoding a polypeptide having 
the activity of a mammalian phospholipid kinase or a mammalian protein kinase 
activated by a phospholipid or its metabolite and regulatory elements to allow 
transcription of the coding sequence in the said cell wherein the regulatory 
elements include a repressible or inducible promoter and the expression of the 
said coding sequence is lethal or growth inhibitory to the cell. 
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Polypeptides having the activity of a phospholipid kinase or a protein kinase 
activated by a phospholipid or its metabolite are involved in cell growth 
regulation. 

5 By "growth inhibitory" we mean that the growth rate of cells transformed with 
the said DNA construct is at least two to three fold lower than the same cells 
not transformed with the said DNA construct when grown in the same culture 
conditions. 

10 By "repressible" we mean that in the presence of a repressing agent the 
expression from the promoter is at least two-fold lower than expression from 
the promoter in the absence of the repressing agent. 

It is preferred if expression from the promoter in the presence of a repressing 
15 agent is at least five-fold lower, more preferably ten-fold lower or even more 
preferably 100-fold lower than expression from the promoter in the absence of 
the repressing agent. 

By "inducible" we mean that in the presence of an inducing agent the 
20 expression from the promoter is at least two-fold higher than expression from 
the promoter in the absence of the inducing agent. 

It is preferred if expression from the promoter in the presence of an inducing 
agent is at least five-fold higher, more preferably ten-fold higher or even more 
25 preferably 100-fold higher than expression from the promoter in the absence of 
the inducing agent. 



30 



When an inducible promoter is used there is sufficiently low expression of the 
polypeptide in the uninduced state that the lethal or growth inhibitory phenotype 
is not observed whereas when the inducing agent is present the lethal or growth 
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inhibitory phenotype is observed. 

When a repressible promoter is used there is sufficiently low expression of the 
polypeptide in the repressed state that the lethal or growth inhibitory phenotype 
5 is not observed whereas when the repressing agent is absent the lethal or 
growth inhibitory phenotype is observed. 

Suitable eukaryotic cells include mammalian cells, such as COS cells and CHO 
cells, insect cells, slime mould such as Dictyostetium, and yeast 

10 

Suitable regulatable mammalian cell promoters include glucocorticoid-inducible 
promoters and the metallothionein promoter. 

It is preferred if the cell is a yeast cell. 

Exemplary genera of yeast contemplated to be useful in the practice of the 
present invention are Pichia, Saccharomyces, Kluyveronxyces, Candida, 
Torulopsis, Hansenula, Schizosaccharomyces, Citeromyces , Pachysolen, 
Debaromyces, Metschunikowia, Rhodosporidium, Leucosporidium, Botryoascus, 
20 Sporidiobolus, Endomycopsis, and the like. 

It is preferred if the yeast is a fission yeast. 

It is further preferred if the yeast is Schizosaccharomyces. 

25 

Preferably, the said polypeptide has the activity of a phospholipid kinase, for 
example a catalytically effective portion of the said kinase. Phospholipid 
kinases include phosphatidyl inositol 3-kinase, phosphatidyl inositol 4-kinase 
and phosphatidyl inositol 5-kinase which phosphorylate the inositol ring on the 
30 3', 4' or 5' hydroxyl, respectively. 
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Suitably, the said polypeptide is a catalytically effective portion of a 
phosphatidylinositol 3-OH kinase. It is convenient to use the 110 kDa 
mammalian Ptdlns 3-kinase catalytic subunit. 



5 In further preference, the said polypeptide is a catalytically effective portion of 
a protein kinase C (PKC). Suitably, the protein kinase C is PKC-7 or PKC-6 
or PKC-i; or PKC-€. 



A constitutive promoter such as adh may be used (disclosed in ref 1). Also, 
10 the SV40 promoter may be used. 

Thus, a further aspect of the invention provides a Schizosaccharomyces cell 
transformed with a DNA construct comprising a coding sequence encoding a 
polypeptide having the activity of a mammalian phospholipid kinase or a 
15 mammalian protein kinase activated by a phospholipid or its metabolite and 
regulatory elements to allow transcription of the coding sequence in the said 
cell wherein the regulatory elements include a constitutive promoter and the 
expression of the said coding sequence is lethal or growth inhibitory to the cell. 

20 Any gene that arrests growth or is lethal can be expressed only transiently for 
the purposes of subsequent inhibitor screening. In the case of a constitutive 
promoter in a plasmid carrying a marker, freshly transfected cells are diluted 
directly into medium using a combination of growth conditions to select for 
transfectants (for example, medium containing no leucine) and added potential 

25 inhibitors of the constitutive! y expressed mammalian gene to test for their 
efficacy. 

Mammalian genes whose expression can be controlled by growth conditions can 
be introduced into the yeast under conditions where expression is low (ie 
30 suppressed or not induced). 
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It is preferred if the mammalian genes so introduced are stably maintained in 
the yeast. 

It is further preferred if the mammalian genes are stably integrated into the 
5 yeast genome. 

Expression is then increased following growth under de-repressing conditions 
(for example removal of thiamine) and potential inhibitors scored on their 
ability to permit growth under these conditions. The use of an integrant and 

10 a controllable promoter provides the most amenable procedure. The induction 
of cell arrest or cell death provides a powerful screen for a suppressor of such 
events. The present invention provides a screen for suppressors of regulatory 
proteins that control other mammalian functions either directly, for example 
protein kinases, or indirectiy through the production of small regulatory 

15 molecules, for example an inositol lipid kinase. 

Thus, in a preferred embodiment, the 5. pombe cells contain a coding sequence 
for the 110 kDa mammalian Ptdlns 3-kinase catalytic subunit under the 
regulatory control of the nmt promoter and with other suitable regulatory 

20 elements, such as a transcription terminator, as is known in the art, for 
expression of the said catalytic subunit. In the presence of thiamine the 
promoter is inoperative and the cells carrying the Ptdlns 3-kinase catalytic 
subunit plasmid grow as the parental strain. (It will be appreciated by those 
skilled in the art that the parental strain may not be wild-type. For example 

25 mutant strains containing Ade" or Leu* or Ura" mutations may be used as the 
parental strain to allow selection of plasmid uptake). In the absence of 
thiamine the nmt promoter functions and the Ptdlns 3-kinase catalytic subunit 
is induced. This has been shown by demonstrating a substantial increase in 
Ptdlns 3-kinase activity under these conditions. However, following this 

30 induction the cells cease to divide; cultures plated in the absence of thiamine 
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do not grow but die. 

Derivative of the runt promoter that retain the thiamine-repressibility 
characteristics of the wild type promoter may also be used. 

5 

As an alternative to the thiamine-repressible runt promoter, the fbpl gene 
promoter from S. porhbe can be used. The fbpl gene promoter is repressed in 
the presence of 8% glucose as disclosed by Hoffman & Winston (1990) 
Genetics 124, 807-816 incorporated herein by reference. Thus, in a further 

10 embodiment, the S. pombe cells contain a coding sequence for the 110 kDa 
mammalian Ptdlns 3-kinase catalytic subunit under the regulatory control of the 
fbpl promoter and with other suitable regulatory elements for expression of the 
said catalytic subunit. In the presence of 8% glucose the function of the 
promoter is repressed and the cells carrying the Ptdlns 3-kinase catalytic 

15 subunit plasmid grow on the parental strains. In the absence of glucose the 
fbpl promoter functions and the Ptdlns 3-kinase catalytic subunit is induced. 

The lethal phenotype of the S. pombe expressing mammalian Ptdlns 3-kinase 
provides a very powerful tool with which to screen for inhibitors of this 

20 activity. Cells plated in the absence of thiamine will survive and proliferate if 
the activity of the Ptdlns 3-kinase is suppressed. A direct demonstration that 
this is indeed the case, is afforded by the finding that a mammalian Ptdlns 3- 
kinase regulatory subunit (p85a) when coexpressed with the Ptdlns 3-kinase 
catalytic subunit will rescue these cells and allow proliferation. Clearly, 

25 therefore, coexpression of (or generally the presence of) the p85or subunit 
should be avoided in the assay of this embodiment, as should, in other 
embodiments, other activity-suppressing compounds. 



In further embodiments the S. pombe cells contain a coding sequence for a 
mammalian protein kinase C under the regulatory control of the nmt promoter 
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or the fbpl promoter. 

As an inhibitor screening process, a further advantage afforded by this 
approach is that general cytostatic and cytotoxic compounds will score negative; 
the screen will distinguish the action of the mammalian Ptdlns 3-kinase or 
protein kinase C against the background of a plethora of essential eukaryotic 
gene functions. 

Thus, a further aspect of the invention provides an assay kit comprising a 
eukaryotic cell according to the first aspect of the invention and culture medium * 
such that the cell will divide and grow and such that the said coding sequence 
is expressed, the expressed polypeptide at least preventing cell division in the 
cell culture. 

15 Conveniently the kit comprises 5. pombe as the eukaryotic cell. 

* 

The invention also encompasses compounds identified as being useful in the 
assays of the invention. 

20 These compounds are useful in the treatment of disease and medical conditions 
where there is an undesirable function of a phospholipid kinase or a protein 
kinase activated by a phospholipid or its metabolite. 

Such diseases and conditions include cancer, inflammation, Alzheimer's 
25 disease, restenosis, atherosclerosis and wound healing. 

Suitable promoters and coding sequence can be incorporated into vectors in the 
correct orientation by methods known in the art, some of which are described 
in Sambrook et al (1989) Molecular Cbning, a practical approach (2nd 
30 Edition), Sambrook, J., Fritsch, E. & Maniatis, T., eds, Cold Spring Harbor 
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Laboratory Press, Cold Spring Harbor, New York, incorporated herein by 
reference. 



A variety of methods have been developed to operatively link DN A to vectors 
via complementary cohesive termini. For instance, complementary 
homopolymer tracts can be added to the DNA segment to be inserted to the 
vector DNA. The vector and DNA segment are then joined by hydrogen 
bonding between the complementary homopolymeric tails to form recombinant 
DNA molecules. 



Synthetic linkers containing one or more restriction sites provide an alternative 
method of joining the DNA segment to vectors. The DNA segment, generated 
by endonuclease restriction digestion as described earlier, is treated with 
bacteriophage T4 DNA polymerase or E. coli DNA polymerase I, enzymes that 
15 remove protruding, 3 '-single-stranded termini with their 3'-5'-exonucleolytic 
activities, and fill in recessed 3 '-ends with their polymerizing activities. 

The combination of these activities therefore generates blunt-ended DNA 
segments. The blunt-ended segments are then incubated with a large molar 

20 excess of linker molecules in the presence of an enzyme that is able to catalyze 
the ligation of blunt-ended DNA molecules, such as bacteriophage T4 DNA 
ligase. Thus, the products of the reaction are DNA segments carrying 
polymeric linker sequences at their ends. These DNA segments are then 
cleaved with the appropriate restriction enzyme and ligated to an expression 

25 vector that has been cleaved with an enzyme that produces termini compatible 
with those of the DNA segment. 



Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including International 
30 Biotechnologies Inc, New Haven, CN, USA. 
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A desirable way to modify the DNA encoding the polypeptide of the invention 
is to use the polymerase chain reaction as disclosed by Saiki et al (1988) 
Science 239, 487-491. 

In this method the DNA to be enzymatically amplified is flanked by two 
specific oligonucleotide primers which themselves become incorporated into the 
amplified DNA. The said specific primers may contain restriction 
endonuclease recognition sites which can be used for cloning into expression 
vectors using methods known in the art. 

Transformation of appropriate cell hosts is accomplished by well known 
methods that typically depend on the type of vector used and host cell. 
Transformation of Saccharomyces and related cells is described in Sherman et 
al (1986) Methods In Yeast Genetics, A Laboratory Manual, Cold Spring 
Harbor, NY. The method of Beggs (1978) Nature 275, 104-109 is also useful. 
With regard to vertebrate cells, reagents useful in transfecting such cells, for 
example calcium phosphate and DEAE-dextran or liposome formulations, are 
available from Stratagene Cloning Systems, or Life Technologies Inc., 
Gaithersburg, MD 20877, USA. 

Schizosaccharomyces pombe may be transformed following LiCl treatment or 
by electroporation. 

Conveniently, a Bio-Rad Pulse Controller may be used for electroporation of 
S. pombe cells. 

a) Grow up cells to OD 595 less than or equal to 0.5 in minimal medium. 

b) Centrifuge cells at 1500 g for 5 min, remove supernatant and resuspend 
in 20 ml ice-cold distilled water, centrifuge again, remove supernatant and 
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resuspend in 20 ml ice-cold 1 M sorbitol, centrifuge again and remove 
supernatant. 



c) Resuspend cells in ice-cold 1 M sorbitol to a density of -5x10 
5 cells/ml (concentrated 500 times when compared to original culture). 



d) Use 40-100 /il of cell suspension per transformation. Add DNA (up to 
100 ng) in 1 /J in TE buffer (10 mM Tris, 1 mM EDTA, pH 8.0) to cells and 
incubate on ice 5 min. 

10 

e) Transfer to pre-chilled cuvettes (0.2 cm gap) and apply pulse (1 .5 KV, 
25 fi¥, 200 Q). 

f) Immediately add 900 fi\ of ice-cold 1 M sorbitol and transfer to a 
15 chilled tube on ice. 



g) Promptly spread 100-200 p\ onto a selective minimal medium plate 
containing 1 M sorbitol and culture at 32°C until grown. 

20 The technique of electroporation of yeast is disclosed in Becker, D.M. and 
Guarente, L. (1990) Meth. EnzymoL 194, 182. 

Machines for electroporation are available from other manufacturers and can 
be used to transform yeast and mammalian cells according to their instructions. 

Successfully transformed cells, ie cells that contain a DNA construct, can be 
identified by well known techniques. For example, cells resulting from the 
introduction of an expression construct of the present invention can be grown 
to produce the polypeptide of the invention. Cells can be harvested and lysed 
30 and their DNA content examined for the presence of the DNA using a method 
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such as that described by Southern (1975) 7. Mol. Biol 98, 503 or Berent et 
al (1985) Biotech. 3, 208. 



In addition to directly assaying for the presence of recombinant DNA, 
5 successful transformation can be confirmed by well known immunological 
methods when the recombinant DNA is capable of directing the expression of 
the protein. For example, cells successfully transformed with an expression 
vector produce proteins displaying appropriate antigenicity. Samples, of cells 
suspected of being transformed are harvested and assayed for the protein using 
10 suitable antibodies, for example by western blotting. 

The invention will now be described in detail with reference to the following 
Examples and Figures wherein: 

15 Figure 1 shows the nucleotide sequence (SEQ ID No 1) and deduced amino 
acid (SEQ ID No 2) of the sequence 110 kDa catalytic subunit of Ptdlns 3- 
kinase (PI 10). 

Figure 2 shows the nucleotide sequence of the nmt promoter region (SEQ ID 



20 No 3). 



Figure 3 shows the nucleotide sequence of PKC-c (SEQ ID No 4). 



Figure 4 shows the nucleotide sequence of PKC-7 (SEQ ID No 5). 



25 



Figure 5 shows the nucleotide sequence of PKC-5 (SEQ ID No 6). 



Figure 6 shows the nucleotide sequence of PKC-17 (SEQ ID No 7). 



30 



Figure 7 shows that the lethal effect of pi 10 expression in 5. pombe is 
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suppressed by p85 expression. 

Figure 8 shows the isotype-specific effects of PKC expression in S. pombe. 

5 Figure 9 shows the effect of PKC expression on growth rates in liquid culture. 

Figure 10 shows that PKC-6-induced growth inhibition is the result of kinase 
activity. 

10 Example 1: Assay using catalytic subunit of Ptdlns 3-kinase and nmt 
promoter 

Isolation of Ptdlns 3-kinase catalytic subunit cDNA. The cDNA for the 1 10 
kDa catalytic subunit can be isolated by a conventional cloning strategy. 

15 Purification of the bovine enzyme from brain tissue (Morgan, Smith et al 1990) 
has demonstrated that sufficient protein can be isolated for protein sequence 
determination. This is unequivocally established for the 85 kDa regulatory 
subunit which has been sequenced from this source and, as a consequence, 
cloned (Otsu, Hiles et al 1991). The Ptdlns 3-kinase from bovine brain (85- 

20 110 dimer) is purified according to Morgan, Smith et al (1990) by sequential 
fractionation with ammonium sulphate and chromatography on DEAE-cellulose, 
phosphocellulose, Sephacryl S-200 and Mono Q. In order to remove 
contaminants and separate subunits, the protein is further purified by sodium 
dodecyl sulphate polyacrylamide gel electrophoresis according to Laemmli 

25 (1970), the 110 kDa protein visualised in ammonium chloride (4N), 
electroeluted and digested with trypsin as described in Katan, Kriz etal (1988). 
Tryptic peptides are then separated by standard procedures and subjected to 
amino acid sequence determination. Sequence established for the 110 kDa 
catalytic subunit is used to predict redundant oligonucleotide probes for 

30 screening a bovine brain cDN A library. Standard cloning procedures are then 
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employed in the isolation of a cDNA encoding the complete open reading frame 
of the 1 10 kDa subunit (Sambrook et al 1989). The sequence of the cDNA is 
determined by commonly employed dideoxy-sequencing procedures. A specific 
example of using this strategy is described by Hiles et al (1992) Cell 70, 419- 
5 429. 

Materials: Restriction enzymes and DNA modification enzymes were obtained 
from standard commercial sources and used according to the manufacturer's 
recommendations. Oligonucleotides were synthesised on an Applied 
10 Biosystems 380B DNA synthesiser and used directly in subsequent procedures; 



Protein Purification and Amino Acid Sequence Determination: The 

purification of the p85or and pi 10 proteins by chromatography on a peptide 
affinity column corresponding to amino acids 742-758 of the kinase insert 

15 region of the human PDGF-jS receptor has been described (Otsu et al (1991) 
Cell 65, 91-104). Proteins were released from the affinity matrix using SDS- 
containing buffers, separated on a Prosieve agarose gel, and visualised by 
staining with Coomassie blue G250. The band corresponding to pi 10 was 
excised and protein was eluted by tube gel HPEC. Protein was precipitated 

20 from pllO-containing fractions by treatment with trichloroacetic acid and then 
washed with acetone. The pi 10-containing pellet was resuspended and digested 
with lysylendopeptidase in the presence of SDS, and peptides were separated 
by tandem ion-exchange chromatography and reverse-phase HPLC. This 
procedure was carried out on three separate PI3-kinase preparations. A fourth 

25 preparation was eluted from the matrix as before and boiled for 5 min. After 
cooling, the sample was diluted with 25 mM Tris-HCl (pH 8.8) and digested 
directly with lysylendopeptidase for 72 hr at 30°C. Peptides were separated 
as above. Peptide sequences were determined using a modified Applied 
Biosystems 477A automated pulse-liquid sequencer. 

30 
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mRNA Isolation and cDNA Cloning: Total RNA was isolated from SGBAF-1 
cells by the method of Chirgwin et al (1979) Biochemistry 18, 294-299 and 
poly(A) mRNA was selected by chromatography on oligo-(dT)-cellulose 
(Maniatis et al (1982) Molecular Cloning: A laboratory manual, Cold Spring 
5 Harbor Press, Cold Spring Harbor, New York). An oligo-dT primed cDNA 

* 

library of 5 x 10 6 primary recombinants was constructed in lambda Uni-Zap 
(Stratagene) from 5 /ig of this mRNA using the Stratagene Uni-Zap cDNA 
cloning system. The construction of the total bovine brain cDNA library in 
lambda Uni-Zap has been described previously (Otsu et al (1991) Cell 65, 91- 
10 104). 



Library Screening and Hybridisations: The unamplified SGBAF-1 cDNA 
library (10 6 recombinants) was plated on E. coli K12 PLK-F (Stratagene) at a 
density of 10 s plaques per 15 cm dish, and lifts were taken in duplicate onto 

15 nitrocellulose membranes (Millipore) , For screening, filters were prehybridised 
for at least 1 hr at 42°C in 6 x SSPE, 0.5% SDS, 10 x Denhardt's solution, 
and 100 pg/ml denatured sonicated herring sperm DNA (Sigma). Hybridisation 
was carried out in the same solution containing 10 ng/ml radiolabeled 
oligonucleotide. Oligonucleotides used were: peptide N, (MDWIFHT; SEQ 

20 ID No 8) 5 # -AA(G/A)ATGGA(T/C)TGGAT(C/T/A)TT(T/C)CA(T/C)AC-3' 
(SEQ ID No 9); peptide J (DDGQLFHIDFGHF; SEQ ID No 10) 5'- 
GATGATGGCC-A(G/A)CTGTT(T/C)CA(T/C)AT(T/A)GA(T/C)- 
TTTGG CC A(T/C)TT (SEQ ID No 1 1). Oligonucleotides were labeled with 32 P 
at the 5' end in a 20 y\ reaction containing 100 ng of oligonucleotide, 1 x 

25 kinase buffer (Promega), 0. 1 mM spermidine, 5 mM dithiothreitol, 100 fcCi of 
[7- 32 P]ATP (5000 Ci/mmol, Amersham), and 2 y\ (20 U) of T4 polynucleotide 
kinase (Amersham). Filters were washed in 6 x SSC, 0.1% SDS at room 
temperature and then subjected to autoradiography using Kodak XAR film. 
Hybridising clones were plaque purified and rescued as plasmids according to 

30 the manufacturer's instructions. 
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Characterisation of cDNA Clones: Sequencing was carried out by the chain 
termination method using the Sequenase system (US Biochemicals). Clones for 
sequencing were obtained by directed cloning of restriction fragments into 
M13mpl8 and mpl9 vectors (Yanisch-Perron et al (1985) Gene 33, 103-119) 
5 and by making a series of exonuclease Hi-mediated deletions (Henikoff (1984) 
Gene 28, 351-359; Pharmacia Exonuclease III deletion kit). DNA sequences 
were analysed on a Micro-VAX computer using the Wisconsin sequence 
analysis package (UWGCG; Devereux et al (1984) Nucl. Acids Res. 12, 387- 
395). 



RACE PCR: RACE PCR was carried out essentially as described previously 
(Frohman etal (1988) Proc. Natl. Acad. Sci. USA 85, 8998-9002; Harvey and 
Garlison (1991) Nucl. Acids Res. 19, 4002). In brief, first-strand cDNA 
primed with random hexamers (Amersham) was synthesised from 1 fig of 

15 SGBAF-1 cell mRNA using the Stratagene first-strand cDNA synthesis kit. 
First-strand cDNA was isolated by isopropanol precipitation and tailed with 
oligo-(dA) using terminal deoxynucleotidyl transferase (Bethesda Research 
Laboratories). PCR was performed using oligo 2224 (5'- 
AATTCACACACTGGCATGCCGAT; SEQ ID No 12) and adaptor dT (5'- 

20 GACTCGAGTCGACATCGA 1 1 1 1 1 1 1 11 ITl ' lTl ' lT ; SEQ ID No 13) as 
primers, using a Perkin-Elmer Cetus Taq polymerase PCR kit (conditions: 30 
cycles of 94°C for 1 min, 35°C for 1 min, 72°C for 2 min). Products were 
fractionated on a 1 .5 % low melting point agarose gel and visualised by staining 
with ethidium bromide. The gel was sliced into six bands (ranging from 150 

25 bp to 2000 bp), and DNA was isolated from each gel slice. A further round 
of PCR was performed on this DNA using oligonucleotide 2280 (5- 
TTTAAGCTTAGGCATTCTAAAGTCACTATCATCCC; SEQ ID No 14) and 
adaptor (5'-GACTCGAGTCGACATCGA; SEQ ID No 15) as primers 
(conditions: 35 cycles of 94°C for 1 min, 56°C for 1 min, 72°C for 2 min). 

30 Products were fractionated on an agarose gel and visualised by staining with 
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ethidium bromide. A band 250 bp shorter than the size of the DNA in the gel 
slice used for the PCR was expected. An intensely staining band of 350 bp 
obtained from the -600 bp gel slice was excised, digested with HindUl and 
Sail, and ligated into Bluescript KS- digested with HindUl and Xhol to give 
plasmid pBS/race. Two independent inserts were completely sequenced. The 
sequence of pi 10, the 110-kD catalytic subunit of PI34rinase is shown in 
Figure 1 and has the GenEMBL Accession No M93259 (SEQ ID No 1). 



Isolation of nmt promoter. The promoter has been isolated by Maundrell (2) 
10 and may be isolated by repeating the procedures reported in that reference. 
Moreover, the sequence of the gene, including the promoter, has been 
submitted to the GenBank™/EMBL database as Accession No J05493 and is 
shown in Figure 2 (SEQ ID No 3). 



15 Vectors containing the nmt promoter and derivatives of the nmt promoter 
suitable for use in the present invention are described by Basi et al (1992) Yeast 
8, S597 (special issue) and Maundrell (1990). 



The upstream regulatory region and downstream polyadenylation site of nmtl 
20 have been incorporated into two types of 5. pombe/E. coli shuttle vector: 
pREP extrachromosomally replicating plasmids and pRIP integrating plasmids. 
Using either of these constructs thiamine mediated transcriptional regulation can 
be transferred to heterologous coding sequences. 



25 The time course of induction and repression have been studied as a function of 
changes in the intracellular thiamine concentration. Addition of thiamine to 
cells growing in minimal medium results in a rapid rise in the internal thiamine 
from a basal level of around 10 pmoles/10 7 cells to up to 1000 fold this level 
and this is accompanied by repression of nmtl promoter activity. If cells are 

30 then washed and allowed to continue growth in minimal medium, the 
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intracellular thiamine is progressively diluted as the cell mass doubles and 
transcription is reinitiated as the internal thiamine concentration falls below 50 
pmoles/10 7 cells. The time taken to re-activate the nmtl promoter therefore 
depends on the internal thiamine concentration at the time when the cells are 
5 transferred to thiamine free medium. 

Quantitation of promoter strength was assessed using chloramphenicol acetyl 
transferase as a reporter gene. The fully induced nmtl promoter is about 6 fold 
more active than the S. pombe adh promoter and its activity is reduced about 

10 80 fold when cells are grown in repressing conditions. These vectors are 
ideally suited to applications requiring maximal expression of a gene of interest. 
In addition, two modified versions of the promoter with reduced activity have 
been created following an analysis of the effects of TATA box mutations. 
Truncating the wild type TATA box, TATATAAA to AT AAA (the M' series) 

15 or AT (the '8* series) down-regulates transcriptional activity of the nmtl 
promoter by approximately 1 and 2 orders of magnitude respectively (see 
Table). These mutations in the TATA box do not affect thiamine repressibility 
or the site of transcription initiation. 

20 The table below summarises the salient features of some of the vectors which 
have been constructed thus far: 
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b the Ball site is replaced with an Xhol site allowing expression from the 
ATG. 

c in some of the vectors the complementation gene used for selection of 
plasmid uptake has been changed from the LEU2 gene to the sup 3.5 
5 gene which complements the Ade 6.704 mutation or to the URA4 gene. 

The backbone of the plasmid is not altered (ie promoter and stop 
sequence from the nmtl gene, ARS1 and pUC119 backbone). 



Construction of an 5. pombe pllO expression system. A suitable restriction 
10 fragment containing the complete 110 kDa subunit open reading frame and 
flanking sequences is subcloned into the nmt promoter plasmid containing a 
suitable marker gene for selection creating an nmf-100 plasmid in order to 
allow expression of the 110 kDa protein under the control of the thiamine 
repressible nmt promoter. The nmt-1 10 plasmid is grown in a suitable bacterial 
15 host and the plasmid purified by conventional techniques (Sambrook et al 
1989). A 3.4 kb BamHl/Fspl fragment containing the cDNA of pi 10 was 
isolated and subcloned into the BamHVSmal sites of pREP3X-pl 10 (nmt-1 10). 



The nmt-110 plasmid is then transfected by standard procedures (Moreno, Klar 
20 et al 1991) into a Schizosaccharomyces pombe strain that is auxotrophic for 
leucine cells are transformed using electroporation. Transfected cells are then 
plated in the presence of thiamine and in the absence of leucine. As an 
alternative Schizosaccharomyces pombe strains which are auxotrophic for 
adenine or uracil (that is Ade" or Ura") may be used; in this case the cells are 
25 plated in the presence of thiamine and absence of adenine, or the presence of 
thiamine and absence of uracil, respectively. Colonies growing up under these 
conditions are then analysed for the presence of the nmt-110 plasmid. The 
lethal phenotype caused by the expression of 110 kDa protein is checked by 
replating colonies in the presence or absence of thiamine; under the latter 
30 conditions colonies will arrest and/or die. 
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For the purposes of setting up a screen for inhibitors, a stable transformant is 
isolated. This is carried out by standard procedures involving growth in the 
presence and absence of the selectable marker leucine (or adenine or uracil). 
Isolates obtained in this manner are checked for the stable insertion of the 1 10 

5 kDa sequence into genomic DNA by Southern analysis or stable replication of 
a non-integrated plasmid. Expression of the pi 10 protein is also confirmed by 
western blot analysis of the transformants using antibodies reactive against 
pi 10, or by measuring the activity of the pi 10 subunit in the transformed cells. 
The inducible lethal phenotype is rechecked by growth of these isolates in the 

10 presence and absence of thiamine (> 10 nM). 



It is preferred if 100 nM, or > 1 pM or > 1 /xM is used. 
It is most preferred if 15 fiM thiamine is used. 

15 

Operating the screen. The screen for inhibitor activity is carried out on a 96- 
well microtitre plate format. An integrant colony is picked and put into liquid 
culture in minimal medium, 2% glucose, 15 /xM thiamine and supplements 
appropriate for the strain (eg uracil 50 pgl r ml would be included for a ura" 

20 strain if the integrated plasmid did not harbour a URA4-based selection 
marker). This culture is grown up and, after extensive washing, used to seed 
two 10 ml cultures, one containing thiamine as above, and one without. The 
cultures are expanded overnight and then diluted to an optical density (OD) at 
595 nm of 0.01-0.10. For those cells requiring treatment for arrest of growth 

25 additions are made at this stage prior to plating. The diluted cultures are then 
aliquoted into wells of a sterile 96 well microtitre plate containing individual 
test compounds in the presence or absence of thiamine. The growth of the cells 
is monitored over time until the OD 595 reached is —0.8 for control cultures. 
Control cultures are those cultured with thiamine. The OD 595 is assessed using 

30 a microtitre plate reader. 
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20 
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The cells precultured in thiamine and retained in thiamine serve to indicate 
optimum growth rate. Cells precultured in the absence of thiamine and then 
put into wells containing thiamine provide a control for the rescue of growth. 
Cells precultured in the absence of thiamine and put into wells in the absence 
of thiamine or test compound provide a baseline for non-growth. Individual 
test compounds are assessed for their potency in permitting growth in the 
absence of thiamine in cells plated in the absence of thiamine. 

Accumulated experience in the operation of this screen for a particular gene 
product permits a less frequent monitoring of the growth curves and a single 
time point may be found to be sufficient. Similarly, cultures propagated 
throughout in the presence of thiamine may be found to be a non-essential 
control. These alterations to the procedure may provide some practical 
advantages in increasing the number of test compounds per 96 well plate and 
in reducing the time required for assessment of growth. 

The above procedures have been employed in creating an S. pombe strain 
harbouring a pi 10 cDNA under the control of the nmt promoter. Switching 
these cells from a medium containing thiamine (15 /iM) to one in the absence 
of thiamine causes growth arrest. Evidence that the arrest is a consequence of 
the expression of the mammalian protein has come from a number of 
observations: 

1 . Transient transfection and subsequent expression has been observed on 
multiple occasions with the pi 10 cDNA and not with the vector alone. 

2. On expression of the pi 10 protein, it is possible to detect the activity 
of the expressed mammalian protein in cell extracts, ie the catalytic 
activity is retained on expression in S. pombe. 



30 



PCI7GB93/01651 

On expression of the mammalian regulatory subunit of the kinase, p85or 
[4], increased expression of pi 10 no longer induces growth arrest. 

The use of this system as a viable tool for screening pi 10 inhibitors is 
5 evidenced by the ability of p85a, the regulatory subunit, to suppress the growth 
arrest phenotype. Biochemical evidence has already established that the p85a- 
pllO complex is less active than the free pllO protein [9]. 

The lethal effect of pi 10 expression in 5. pombe is suppressed by p85 
expression as shown in Figure 7. Stable pi 10-expressing S. pombe cells were 
transformed with the pREP4 vector, or the pREP4-p85a or pREP4-p850 
constructs and, after selection for plasmid uptake, were streaked onto selective 
minimal medium plates in the presence or absence of thiamine. Expression of 
pi 10 alone is lethal but this effect is rescued by co-expression of either p85a 
or p85j3. 

The p85a and p850 cDNAs can be obtained using the methods described by 
Otsu et al (1991) Cell 65, 91-104 incorporated herein by reference, 

20 Example 2: Isotvpe-specific effects of PKC expression in S. pombe and the 
effect of PKC expression on growth rates in liquid culture 

5. pombe strains containing integrated plasmids for expression of mammalian 
PKC-7, -5, -e, -f or -77 were streaked onto selective minimal medium plates in 

25 the absence of thiamine or the presence of thiamine or TPA as shown in Figure 
8. Growth of control (vector) or PKC-f cells was similar under all three 
conditions. PKC-7 expression (Figure 8, plate B) marginally decreased growth 
and TPA addition to these cells totally supressed growth (Figure 8, plate C). 
PKC-5, -6 and -ij expression alone was markedly growth inhibitory (Figure 8, 

30 plate B). 



WO 94/03609 



3. 



10 



15 



WO 94/03609 PCT/GB93/01651 



• 




23 

Stable PKC-S. pombe strains were cultured in minimal medium in the absence 
of thiamine for 18 hours until an OD 595 of 0.2-0.5 was attained (see Figure 9). 
Strains were then (at time zero) diluted to an OD 595 of 0,02 in minimal medium 
and cultured in the presence of 1 fiM thiamine (controls) (a), in the absence 

5 of thiamine (■) or in the absence of thiamine with 100 ng/ml TPA (O). At 
the indicated times, the cell density was calculated by measuring the OD 595 . 
PKC-f cells grew at a rate essentially indistinguishable from vector controls. 
PKC-5, -e and -rj expression markedly delayed growth when compared with 
vector controls (-thiamine). Growth of PKC-7, -6 and -77 expressing cells was 

10 essentially nil when cultured in the presence of TPA. 



Example 3: An inhibitor screen for, protein kinase 



Protein kinase C-e [10] cDNA (Figure 3; SEQ ID No 4) has been introduced 
15 into a plasmid under the control of the rant promoter yielding nm/-PKC-e. A 
2.7 Id) Xhol fragment with the full coding sequence for PKG* was isolated 
from pMT2-PKC-e and subcloned into SaZI-digested pREP3X. Then 300 bp of 
5' non-coding sequence was removed by digesting with Xhol and Ncol, blunting 
the ends and religating to give pREP3X-PKC-€. The plasmid pMT2-PKC-e can 
20 be prepared by the methods described by Schaap et al (1989) FEES Lett. 243, 
351-357. Transfection of this construct into 5. pombe employing selection for 
uptake of the LEU2 gene in the presence of thiamine, yields populations of 
cells that on switching to "no thiamine" conditions while retaining selection for 
LEU2, reduce growth rate. 

25 

Growth inhibition is consistent with the expression of the mammalian PKC-e 
gene product since: 



1 . Growth inhibition correlates with an induction of the PKC-e protein as 
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judged by Western analysis. 



2. The induced phenotype also correlates with expression of PKC-€ 
activity as determined in cell extracts. 

3 . Suppression of PKC-e expression by exposure to the phorbol ester TPA 
can rescue cells that are expressing low levels of PKC-€ (cells 
expressing high levels of PKC-€ are not rescued and the steady state 
level of PKC-e is not significantly depressed by TPA treatment). 

The expression of a functional PKC-e activity in S. pombe and its correlation 
with growth arrest under various growth conditions provides the basis for an 
inhibitor screen. 



15 The transformed cells are plated in the presence of thiamine (control) and the 
absence of thiamine (test) and the compound to be assayed is added to the 
"test" plates. 

Example 4: An inhibitor screen for pr otein kinase Ov. 

20 

A cDNA for PKC-7 (Figure 4; SEQ ID No 5) has been introduced into a 
plasmid under the control of the rant promoter, producing /wtf-PKC-7. A 2.4 
kb BamHI/blunt HindUl fragment with the full coding sequence of PKC-7 was 
isolated from pSP64-PKC-7 and subcloned into the BamlWSmal sites of 

25 pREP3X to give pREP3X-PKC-7. The plasmid pSP64-PKC-7 can be prepared 
as described by Patel & Stabel (1989) Cell. Signall. 1, 227-240.Transfection 
of S. pombe with wnr-PKC-7 yields populations of cells that on switching to 
medium without thiamine induce PKC-7 protein as determined by Western 
blotting and the detection of PKC activity in cell extracts. These cells continue 

30 to grow on induction but if the PKC-7 is selectively activated by inclusion of 
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the phorbol ester TPA in the growth medium, the cells will arrest. The 
dependence of growth arrest upon the inclusion of TPA provides direct 
evidence that the catalytic function of PKC-7 is responsible for the phenotype. 
No such arrest is observed on treatment of the original S. porribe strain. Other 
5 PKC activators, such as Mezerein, or other phorbol esters or diacylglycerols 
may be used in place of TPA. 



That activation of PKC-7 induces growth arrest provides a screen for inhibition 
of function of this mammalian gene product. 

10 

Operating the screen. The screen for inhibitor activities is carried out on a 
96-well microtitre plate format. For thiamine repressible genes, stable 
integrants are grown up overnight (12 h) in the absence of thiamine. The 
culture is then diluted in the absence of thiamine to an OD 595 = 0.01 to 0.10. 

15 The culture is then aliquoted into microtitre wells containing the potential 
inhibitors and, in the case of PKC-7, also phorbol ester. The growth of cells 
monitored at 595 nm using a microtitre plate reader. Cells are allowed to grow 
until parallel wells/plates containing cells growing in the presence of thiamine 
(15 fM) have increased their OD 595 to 1 .0 units. Cells from the test wells that 

20 have proliferated can be scored relative to both control wells (eg +thiamine) 
and no addition wells (-inhibitor, -thiamine). 

Thus, for PKC-y there are the following possibilities: (i) control plates which 
are + thiamine or -thiamine or -thiamine + TPA and (ii) test plates which are 
25 + thiamine + compound or -thiamine + compound or -thiamine + TPA + 
compound. 



* 
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Example 5: An inhibitor screen for protein kinase C-8 



A cDNA for PKC-5 (Figure 5; SEQ ID No 6) has been introduced into a 
plasmid under the control of the nmt promoter, producing nmr-PKC-6. A 2.4 

5 kb blunt PfMVNdel fragment containing the full coding sequence of PKC-5 
was isolated from pBluescript-PKC-5 and subcloned into blunt Sa/I-digested 
pREP3X to give pREP3X-PKC-5. The plasmid pBluescript-PKC-5 can be 
obtained using the methods described in Olivier & Parker (1991) Eur. J. 
Biochem. 200, 805-810 incorporated herein by reference. Transfection of S. 

10 pombe with nmr-PKC-5 yields populations of cells that on switching to medium 
without thiamine induce PKC-5 protein as determined by Western blotting and 
by activity measurements. There is marked growth inhibition by expression 
alone and if the PKC-5 is activated by inclusion of the phorbol ester TP A in the 
growth medium, the phenotype is strengthened. Experiments with PKC-5 also 

15 provide firm evidence that the phenotype is a result of the function of the 
kinase. Part of the kinase domain of PKC-5 was deleted thus rendering it 
enzymatically inactive. The product was expressed to a high level in 5. pombe 
but there was no growth inhibition thus indicating that the phenotype is due to 
the functional kinase. 

20 

That activation of PKC-5 induces growth inhibition provides a screen for 
inhibition of function of this mammalian gene product. 

Operating the screen. The screen for inhibitor activities is carried out on a 
25 96-well microtitre plate format. For thiamine repressible genes, stable 
integrants are grown up overnight (12 h) in the absence of thiamine. The 
culture is then diluted in the absence of thiamine to an OD 595 = 0.01 to 0.10. 
The culture is then aliquoted into microtitre wells containing the potential 
inhibitors and, in the case of PKC-7, also phorbol ester. The growth of cells 
30 monitored at 595 nm using a microtitre plate reader. Cells are allowed to grow 
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until parallel wells/plates containing cells growing in the presence of thiamine 
(15 fjM) have increased their OD 595 to 1 .0 units. Cells from the test wells that 
have proliferated can be scored relative to both control wells (+thiamine) and 
no addition wells (-inhibitor, -thiamine). Additionally, the test wells may 
5 contain or lack TP A. 

Figure 10 shows that the PKC-5-induced growth inhibition is the result of 
kinase activity. S. pombe cells were transformed with a control vector or 
vectors to express the full length PKC-5 protein or a PKC-5 protein in which 

10 part of the catalytic domain has been deleted to render it functionally inactive- 
as a protein kinase (PKC-5A). After selection for uptake of plasmid, a number 
of colonies were plated onto selective medium plates in the presence of 
thiamine, the absence of thiamine or the presence of TP A. PKC-5 expression 
markedly inhibits growth (-thiamine plate) and addition of TP A increases die 

15 effect. In contrast, expression of PKC-5 A has no effect on growth under any 
condition. 



Example 6: An inhibitor screen for protein kinase Cry . 



20 A cDNA for PKC-rj (Figure 6; SEQ ID No 7) has been introduced into a 
plasmid under the control of the nmt promoter, producing nmt-PKC-rj. A 3.3 
kb Xhol fragment containing the coding sequence for PKC-ij was isolated from 
pBluescript-PKC-77 and subcloned into Sa/I-digested pREP3X to give pREP3X- 
PKC-ij. The plasmid pBluescript-PKC-77 can be obtained using the methods 

25 described by Dekker et al (1992) FEBSLett. 312, 195-199. Transfection of S. 
pombe with nmr-PKC-7j yields populations of cells that on switching to medium 
without thiamine induce PKC-17 protein as determined by Western blotting and 
the detection of PKC activity in cell extracts. However, there is some 
expression even in the presence of thiamine which produces —50% growth 

30 inhibition. There is an even more marked growth inhibition by derepressed 
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expression alone and if the PKC-77 is selectively activated by inclusion of the 
phorbol ester TP A in the growth medium, there is no growth. 

That activation of PKC-tj induces growth inhibition provides a screen for 
5 inhibition of function of this mammalian gene product. 

Operating the screen. The screen for inhibitor activities is carried out on a 
96-well microtitre plate format. For thiamine repressible genes, stable 
integrants are grown up overnight (12 h) in the absence of thiamine. The 

10 culture is then diluted in the absence of thiamine to an OD 595 = 0.01 to 0.10. 
The culture is then aliquoted into microtitre wells containing the potential 
inhibitors and, in the case of PKC-7, also phorbol ester. The growth of cells 
monitored at 595 nm using a microtitre plate reader. Cells are allowed to grow 
until parallel wells/plates containing cells growing in the presence of thiamine 

15 (15 fiM) have increased their OD 595 to 1 .0 units. Cells from the test wells that 
have proliferated can be scored relative to both control wells (+ thiamine) and 
no addition wells (-inhibitor, -thiamine). Additionally, the test wells may 
contain or lack TP A. 



20 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Imperial Cancer Research Technology Ltd 

(B) STREET: Sardinia House, Sardinia Street 

(C) CITY: London 

(E) COUNTRY: United Kingdom 

(P) POSTAL CODE (ZIP): HC2A 3NL 

(G) TELEPHONE: 071 242 1136 

(H) TELEFAX: 071 831 4991 

<ii) TITLE OF INVENTION: Transformed cells and assays using them 
(iii) NUMBER OF SEQUENCES: 15 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3498 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .3204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATG CCT CCA AGA CCA TCA TCA GGT GAA CTG TGG GGC ATC CAC TTG ATG 48 
Met Pro Pro Arg Pro Ser Ser Gly Glu Leu Trp Gly lie His Leu Met 
1 5 10 15 

CCC CCA AGA ATC CTA GTA GAA TGT TTA CTA CCA AAT GGG ATG ATA GTG 96 
Pro Pro Arg lie Leu Val Glu Cys Leu Leu Pro Asn Gly Met lie Val 

20 25 30 

ACT TTA GAA TGC CTC CGT GAG GCT ACG TTA ATA AOG ATA AAG CAT GAA 144 
Thr Leu Glu Cys Leu Arg Glu Ala Thr Leu He Thr He Lys His Glu 
35 40 45 

CTA TTT AAA GAA GCA AGA AAA TAC CCT CTC CAT CAA CTT CTT CAA GAT 192 
Leu Phe Lys Glu Ala Arg Lys Tyr Pro Leu His Gin Leu Leu Gin Asp 
50 55 60 



GAA TCT TCT TAC ATT TTC GTA AGT GTT ACC CAA GAA GCA GAA AGG GAA 
Glu Ser Ser Tyr He Phe Val Ser Val Thr Gin Glu Ala Glu Arg Glu 



240 
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65 



70 



75 



60 



GAA TTT TTT GAT GAA ACA AGA CGA CTT TGT GAC CTT CGG CTT TTT CAA 
Glu Phe Phe Asp Glu Thr Arg Arg Leu Cys Asp Leu Arg Leu Phe Gin 

85 90 95 



288 



CCC TTT TTA AAA GTA ATT GAA CCA GTA GGC AAC CGT GAA GAA AAG ATC 
Pro Phe Leu Lys Val IXe Glu Pro Val Gly Asn Arg Glu Glu Lye lie 

100 105 110 

CTC AAT CGA GAA ATT GGT TTT GCT ATC GGC ATG CCA GTG TGT GAA TTC 
Leu Asn Arg Glu lie Gly Phe Ala lie Gly Met Pro Val Cys Glu Phe 
115 120 125 



336 



384 



GAT ATG GTT AAA GAT CCA GAA GTA CAG GAC TTC CGA AGA AAT ATT CTC 
Asp Met Val Lys Asp Pro Glu Val Gin Asp Phe Arg Arg Asn lie Leu 
130 135 140 

AAT GTT TGT AAA GAA GCT CTC GAT CTT AGG GAT CTT AAT TCA CCT CAT 
Asn Val Cys Lys Glu Ala Val Asp Leu Arg Asp Leu Asn Ser Pro His 
145 150 155 160 

AGT AGA GCA ATG TAT GTT TAT CCT CCA AAT GTA GAA TCT TCA CCA GAA 
Ser Arg Ala Met Tyr Val Tyr Pro Pro Asn Val Glu Ser Ser Pro Glu 

165 170 175 

CTG CCA AAG CAC ATA TAT AAT AAA TTG GAT AAA GGG CAA ATA ATA GTG 
Leu Pro Lys His He Tyr Asn Lys Leu Asp Lys Gly Gin He He Val 

180 185 190 

GTG ATT TGG GTA ATA GTT TCT CCA AAT AAT GAC AAA CAG AAG TAT ACT 
Val He Trp Val He Val Ser Pro Asn Asn Asp Lys Gin Lys Tyr Thr 
195 200 205 

CTG AAA ATC AAC CAT GAC TGT GTG CCA GAA CAA GTA ATT GCT GAA GCA 
Leu Lys He Asn His Asp Cys Val Pro Glu Gin Val He Ala Glu Ala 
210 215 220 

ATC AGG AAA AAA ACT CGA AGT ATG TTG CTA. TCA TCT GAA CAA CTA AAA 
He Arg Lys Lys Thr Arg Ser Met Leu Leu Ser Ser Glu Gin Leu Lys 
225 230 235 240 

CTC TGT GTT TTA GAA TAT CAG GGC AAG TAT ATT TTA AAA GTG TGT GGA 
Leu Cys Val Leu Glu Tyr Gin Gly Lys Tyr He Leu Lyc Val Cys Gly 

245 250 255 

TGT GAT GAA TAC TTC CTA GAA AAA TAT CCT CTG AGT CAG TAT AAG TAT 
Cys Asp Glu Tyr Phe Leu Glu Lys Tyr Pro Leu Ser Gin Tyr Lys Tyr 

260 265 270 

ATA AGA AGC TGT ATA ATG CTT GGG AGG ATG CCC AAT TTG ATG CTG ATG 
He Arg Ser Cys He Met Leu Gly Arg Met Pro Asn Leu Met Leu Met 
275 280 285 

GCT AAA GAA AGC CTC TAT TCT CAA CTG CCA ATG GAC TGT TTT ACA ATG 
Ala Lys Glu Ser Leu Tyr Ser Gin Leu Pro Met Asp Cys Phe Thr Met 
290 295 300 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



CCA TCA TAT TCC AGA CGC ATC TCC ACA GCT ACG CCA TAT ATG AAT GGA 
Pro Ser Tyr Ser Arg Arg He Ser Thr Ala Thr Pro Tyr Met Asn Gly 
305 310 315 320 

GAA ACA TCT ACA AAA TCC CTT TGG GTT ATA AAT AGT GCA CTC AGA ATA 
Glu Thr Ser Thr Lys Ser Leu Trp Val He Asn Ser Ala Leu Arg He 

325 330 335 



960 



1008 
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AAA ATT CTT TGT GCA ACC TAT GTG AAT GTA AAT ATT CGA GAC ATT GAC 
Lys He Leu Cys Ala Thr Tyr Val Asn Val Asn He Arg Asp He Asp 

340 345 350 

AAG ATT TAT GTT CGA ACA GGT ATC TAC CAT GGA GGA GAA CCC TTA TGT 
Lys He Tyr Val Arg Thr Gly He Tyr His Gly Gly Glu Pro Leu Cys 
355 360 365 

GAT AAT GTG AAC ACT CAA AGA GTA CCT TGT TCC AAT CCC AGG TGG AAT 
Asp Asn Val Asn Thr Gin Arg Val Pro Cys Ser Asn Pro Arg Trp Asn 
370 375 380 

GAA TGG CTG AAT TAC GAT ATA TAC ATT CCT GAT CTT CCT CGT CCT GCT 
Glu Trp Leu Asn Tyr Asp He Tyr He Pro Asp Leu Pro Arg Ala Ala 
385 390 395 400 

CGA CTT TGC CTT TCC ATT TGT TCT GTT AAA GGC CGA AAG GGT GCT AAA 
Arg Leu Cys Leu Ser He cys ser Val Lys Gly Arg Lys Gly Ala Lys 

405 410 415 

GAG GAA CAC TGT CCA TTG GCC TGG GGA AAT ATA AAC TTG TTT GAT TAC 
Glu Glu His Cys Pro Leu Ala Trp Gly Asn He Asn Leu Phe Asp Tyr 

420 425 430 

ACA GAT ACT CTA GTA TCT GGA AAA ATG GCT TTG AAT CTT TGG CCA GTA 
Thr Asp Thr Leu Val Ser Gly Lys Met Ala Leu Asn Leu Trp Pro Val 
435 440 445 

CCT CAT GGA CTA GAA GAT TTG CTG AAC CCT ATT GGT GTT ACT GGA TCA 
Pro His Gly Leu Glu Asp Leu Leu Asn Pro He Gly Val Thr Gly Ser 
450 455 460 

AAT CCA AAT AAA GAA ACT CCA TGT TTA GAG TTG GAG TTT GAC TGG TTC 
Asn Pro Asn Lys Glu Thr Pro Cys Leu Glu Leu Glu Phe Asp Trp Phe 
465 470 475 480 

AGC AGT GTG GTA AAG TTT CCA GAT ATG TCA GTG ATT GAA GAG CAT GCC 
Ser Ser Val Val Lys Phe Pro Asp Met Ser Val He Glu Glu His Ala 

485 490 495 

AAT TGG TCT GTA TCC CGT GAA GCA GGA TTT AGT TAT TCC CAT GCA GGA 
Asn Trp Ser Val Ser Arg Glu Ala Gly Phe Ser Tyr Ser His Ala Gly 

500 505 510 

CTG AGT AAC AGA CTA GCT AGA GAC AAT GAA TTA AGA GAA AAT GAT AAA 
Leu Ser Asn Arg Leu Ala Arg Asp Asn Glu Leu Arg Glu Asn Asp Lys 
515 520 525 

GAA CAG CTC CGA GCA ATT TGT ACA CGA GAT CCT CTA TCT GAA ATC ACT 
Glu Gin Leu Arg Ala He Cys Thr Arg Asp Pro Leu Ser Glu He Thr 
530 535 540 

GAG CAA GAG AAA GAT TTT CTG TGG AGC CAC AGA CAC TAT TGT GTA ACT 
Glu Gin Glu Lys Asp Phe Leu Trp Ser His Arg His Tyr Cys Val Thr 
545 550 555 560 

ATC CCC GAA ATT CTA CCC AAA TTG CTT CTG TCT GTT AAA TGG AAC TCT 
He Pro Glu He Leu Pro Lys Leu Leu Leu Ser Val Lys Trp Asn Ser 

565 570 575 

AGA GAT GAA GTA GCT CAG ATG TAC TGC TTG GTA AAA GAT TGG CCT CCA 
Arg Asp Glu Val Ala Gin Met Tyr Cys Leu Val Lys Asp Trp Pro Pro 

580 585 590 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



ATC AAG CCT GAA CAG GCT ATG GAG CTT CTG GAC TGC AAT TAC CCA GAT 
He Lys Pro Glu Gin Ala Met Glu Leu Leu Asp Cys Asn Tyr Pro Asp 



1824 
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595 600 605 

CCT ATG GTT CGA GGT TTT GCT GTT CGG TGC TTA GAA AAA TAT TTA ACA 1872 
Pro Met Val Arg Gly Phe Ala Val Arg Cys Leu Glu Lys Tyr Leu Thr 
610 615 620 

GAT GAC AAA CTT TCT CAG TAC CTA ATT CAG CTA GTA CAG GTA CTA AAA 1920 
Asp Asp Lys Leu Ser Gin Tyr Leu lie Gin Leu Val Gin Val Leu Lys 
625 630 635 640 

TAT GAA CAG TAT TTG CAT AAC CTG CTT GTG AGA TTT TTA CTC AAA AAA 1968 
Tyr Glu Gin Tyr Leu Asp Asn Leu Leu Val Arg Phe Leu Leu Lys Lys 

645 650 655 

GCG TTA ACT AAT CAA AGG ATC GGT CAC TTT TTC TTT TGG CAT TTA AAA 2016 
Ala Leu Thr Asn Gin Arg lie Gly His Phe Phe Phe Trp His Leu Lys 

660 665 670 

TCT GAG ATG CAC AAT AAA ACA GTT AGT CAG AGG TTT GGC CTG CTT TTG 2064 
Ser Glu Met His Asn Lys Thr Val Ser Gin Arg Phe Gly Leu Leu Leu 
675 680 685 

GAG TCC TAT TGC CGT GCA TGT GGG ATG TAT CTG AAG CAC CTT AAT AGG 2112 
Glu Ser Tyr Cys Arg Ala Cys Gly Met Tyr Leu Lys His Leu Asn Arg 
690 695 700 

CAA GTT GAG GCT ATG GAA AAG CTC ATT AAC TTG ACT GAC ATT CTC AAA 2160 
Gin Val Glu Ala Met Glu Lys Leu lie Asn Leu Thr Asp lie Leu Lys 
705 710 715 720 

CAA GAG AAG AAG GAT GAA ACA CAA AAG GTA CAG ATG AAG TTT TTA GTT 2208 
Gin Glu Lys Lys Asp Glu Thr Gin Lys Val Gin Met Lys Phe Leu Val 

725 730 735 

GAG CAA ATG CGG CGA CCA GAT TTC ATG GAT GCT CTC CAG GGC TTT CTG 2256 
Glu Gin Met Arg Arg Pro Asp Phe Met Asp Ala Leu Gin Gly Phe Leu 

740 745 750 

TCT CCT CTA AAC CCT GCT CAT CAG CTG GGA AAT CTC AGG CTT GAA GAG 2304 
Ser Pro Leu Asn Pro Ala His Gin Leu Gly Asn Leu Arg Leu Glu Glu 
755 760 765 

TGT CGA ATT ATG TCT TCT GCA AAA AGG CCA CTG TGG TTG AAT TGG GAG 2352 
Cys Arg lie Met Ser Ser Ala Lys Arg Pro Leu Trp Leu Asn Trp Glu 
770 775 780 

AAC CCA GAC ATC ATG TCA GAA TTA CAC TTT CAG AAC AAT GAG ATC ATC 2400 
Asn Pro Asp lie Met Ser Glu Leu His Phe Gin Asn Asn Glu He He 
785 790 795 800 

TTT AAA AAT GGG GAT GAT TTA CGG CAA GAT ATG CTA ACC CTT CAG ATT 2448 
Phe Lys Asn Gly Asp Asp Leu Arg Gin Asp Met Leu Thr Leu Gin He 

805 810 815 

ATT CGC ATT ATG GAA AAT ATC TGG CAA AAT CAA GGT CTT GAT CTT CGA 2496 
He Arg He Met Glu Asn He Trp Gin Asn Gin Gly Leu Asp Leu Arg 

820 825 830 

ATG TTA CCT TAT GGA TGT CTG TCA ATC GGT GAC TGT G*£G GGA CTT ATC 2544 
Met Leu Pro Tyr Gly Cys Leu Ser He Gly Asp Cys Val Gly Leu He 
835 840 845 



GAG GTG GTG AGA AAT TCT CAC ACT ATA ATG CAG ATT CAG TGT AAA GGA 
Glu Val Val Arg Asn Ser His Thr He Met Gin He Gin Cys Lys Gly 
850 855 860 



2592 
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GGC CTG AAA GGT GCA CTG CAG TTT AAC AGC CAC ACA CTC CAT CAG TGG 2640 

Gly Leu Lys Gly Ala Leu Gin Phe Asn Ser His Thr Leu His Gin Trp 
865 870 875 880 

CTC AAA GAC AAG AAC AAG GGG GAA ATA TAT GAT GCG GCC ATC GAT TTG 2688 
Leu Lys Asp Lys Asn Lys Gly Glu lie Tyr Asp Ala Ala lie Asp Leu 

885 890 895 

TTT ACA CGA TCA TGT GCT GGA TAT TGT GTT GCC ACC TTC ATT TTG GGA 2736 
Phe Thr Arg Ser Cys Ala Gly Tyr Cys Val Ala Thr Phe lie Leu Gly 

900 905 910 

ATT CGA GAT CGT CAC AAT AGT AAT ATC ATG GTT AAA GAT GAT GGA CAA 2784 
He Gly Asp Arg His Asn Ser Asn He Met Val Lys Asp Asp Gly Gin 
915 920 925 

CTG TTT CAT ATA GAT TTT GGA CAC TTT TTG GAT CAC AAG AAG AAA AAA 2832 
Leu Phe His He Asp Phe Gly His Phe Leu Asp His Lys Lys Lys Lys 
930 935 940 

TTT GGT TAT AAA CGA GAG CGC GTG CCG TTT GTT TTG ACA CAA GAT TTC 2880 
Phe Gly Tyr Lys Arg Glu Arg Val Pro Phe Val Leu Thr Gin Asp Phe 
945 950 955 960 

TTA ATA GTG ATT AGT AAA GGA GCC CAA GAA TGC ACA AAG ACA AGA GAA 2928 
Leu He Val He Ser Lys Gly Ala Gin Glu Cys Thr Lys Thr Arg Glu 

965 970 975 

TTT GAG AGG TTT CAG GAG ATG TGT TAC AAG GCT TAT CTA GCT ATT CGG 2976 
Phe Glu Arg Phe Gin Glu Met Cys Tyr Lys Ala Tyr Leu Ala He Arg 

980 985 990 

CAG CAT GCC AAT CTC TTC ATA AAT CTT TTC TCA ATG ATG CTT GGC TCT 3024 
Gin His Ala Asn Leu Phe He Asn Leu Phe Ser Met Met Leu Gly Ser 
995 1000 1005 

GGA ATG CCA GAA CTG CAA TCT TTT GAT GAT ATT GCA TAC ATT CGA AAG 3072 
Gly Met Pro Glu Leu Gin Ser Phe Asp Asp He Ala Tyr He Arg Lys 
1010 1015 1020 

ACC CTA GCT TTA GAT AAA ACT GAG CAA GAG GCT TTG GAG TAT TTC ATG 3120 
Thr Leu Ala Leu Asp Lys Thr Glu Gin Glu Ala Leu Glu Tyr Phe Met 
1025 1030 1035 1040 

AAA CAA ATG AAT GAT GCA CAC CAT GGT GGC TGG ACA ACA AAA ATG GAT 3168 
Lys Gin Met Asn Asp Ala His His Gly Gly Trp Thr Thr Lys Met Asp 

1045 1050 1055 

TGG ATC TTC CAC ACA ATT AAG CAG CAT GCT TTG AAC TGAAATGATA 3214 
Trp He Phe His Thr He Lys Gin His Ala Leu Asn 

1060 1065 



ACTAAAAGCT 


CAGTATCTGG 


ATTCTACACT 


GCACTGTTAA 


TAACTGTCAA 


CAGGCAAAGA 


3274 


CTGATTGCAT 


AGGAATTGCA 


CAATCCATGA 


ACAGCATTAG 


AATTTACAGC 


AAGAACAGAA 


3334 


ATAAAATACT 


ATATAATTTA 


AATAATGTAA 


ACGCAAACAG 


GGTTTGATAG 


CACTAAACTA 


3394 


GTTCATTTCA 


AAATTAAGCT 


TTAGAATAAT 


GCGCAATTTC 


ATGTTATGCC 


TTAAGTCCAA 


3454 


AAAGGTAAAC 


TTTAAAGATT 


GTTTGTATCT 


TTCCTTTAAA 


AAAA 




3498 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1068 amino acids 

(B) TYPE* amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Pro Pro Arg Pro ser Ser Gly Glu Leu Trp Gly lie His Leu Met 
15 10 15 

Pro Pro Arg lie Leu Val Glu Cys Leu Leu Pro Asn Gly Met lie Val 

20 25 30 

Thr Leu Glu Cys Leu Arg Glu Ala Thr Leu lie Thr lie Lye His Glu 
35 40 45 

Leu Phe Lys Glu Ala Arg Lys Tyr Pro Leu His Gin Leu Leu Gin Asp 
50 55 60 

Glu Ser Ser Tyr lie Phe Val Ser Val Thr Gin Glu Ala Glu Arg Glu 
65 70 75 80 

Glu Phe Phe Asp Glu Thr Arg Arg Leu Cys Asp Leu Arg Leu Phe Gin 

85 90 95 

Pro Phe Leu Lys Val lie Glu Pro Val Gly Asn Arg Glu Glu Lys lie 

100 105 110 

Leu Asn Arg Glu lie Gly Phe Ala He Gly Met Pro Val Cys Glu Phe 
115 120 125 

Asp Met Val Lys Asp Pro Glu Val Gin. Asp Phe Arg Arg Asn He Leu 
130 135 140 

Asn Val Cys Lys Glu Ala Val Asp Leu Arg Asp Leu Asn Ser Pro His 
145 150 155 160 

Ser Arg Ala Met Tyr Val Tyr Pro Pro Asn Val Glu Ser Ser Pro Glu 

165 170 175 

Leu Pro Lys His He Tyr Asn Lys Leu Asp Lys Gly Gin He He Val 

180 185 190 

Val He Trp Val He Val Ser Pro Asn Asn Asp Lys Gin Lys Tyr Thr 
195 200 205 

Leu Lys He Asn His Asp Cys Val Pro Glu Gin Val He Ala Glu Ala 
210 215 220 

He Arg Lys Lys Thr Arg Ser Met Leu Leu Ser Ser Glu Gin Leu Lys 
225 230 235 240 

Leu Cys Val Leu Glu Tyr Gin Gly Lys Tyr He Leu Lys Val Cys Gly 

245 250 255 

Cys Asp Glu Tyr Phe Leu Glu Lys Tyr Pro Leu Ser Gin Tyr Lys Tyr 

260 265 270 

He Arg Ser Cys He Met Leu Gly Arg Met Pro Asn Leu Met Leu Met 
275 280 285 

Ala Lys Glu Ser Leu Tyr Ser Gin Leu Pro Met Asp Cys Phe Thr Met 
290 295 300 

Pro Ser Tyr Ser Arg Arg He Ser Thr Ala Thr Pro Tyr Met Asn Gly 



WO 94/03609 



PCT/GB93/01651 





36 



305 



310 



315 



320 



Glu Thr Ser Thr Lys Ser Leu Trp Val lie Asn Ser Ala Leu Arg lie 

325 330 335 

Lys lie Leu Cys Ala Thr Tyr Val Asn Val Asn lie Arg Asp He Asp 

340 345 350 

Lys He Tyr Val Arg Thr Cly He Tyr His Gly Gly Glu Pro Leu Cye 
355 360 365 

Asp Asn Val Asn Thr Gin Arg Val Pro Cys Ser Asn Pro Arg Trp Asn 
370 375 380 



Glu 
385 



Leu Asn Tyr Asp He Tyr He Pro Asp Leu Pro 

390 395 



Ala Ala 
400 



Arg Leu Cys Leu Ser He Cys Ser Val Lys Gly Arg Lys Gly Ala Lys 

405 410 415 

Glu Glu His Cys Pro Leu Ala Trp Gly Asn He Asn Leu Phe Asp Tyr 

420 425 430 

Thr Asp Thr Leu Val Ser Gly Lys Met Ala Leu Asn Leu Trp Pro Val 
435 440 445 

Pro His Gly Leu Glu Asp Leu Leu Asn Pro He Gly Val Thr Gly Ser 
450 455 460 

Asn Pro Asn Lys Glu Thr Pro Cys Leu Glu Leu Glu Phe Asp Trp Phe 
465 470 475 480 



Val Val Lys Phe 
485 



Asp Met Ser Val He Glu Glu His Ala 
490 495 



Asn Trp 



Ser Val 
500 



Arg Glu Ala Gly Phe Ser Tyr 

505 



His Ala Gly 
510 



Leu Ser Asn Arg Leu Ala Arg Asp Asn Glu Leu Arg Glu Asn Asp Lys 
515 520 525 

Glu Gin Leu Arg Ala He Cys Thr Arg Asp Pro Leu Ser Glu He Thr 
530 535 540 

Glu Gin Glu Lys Asp Phe Leu Trp Ser His Arg His Tyr Cys Val Thr 
545 550 555 560 

He Pro Glu He Leu Pro Lys Leu Leu Leu Ser Val Lys Trp Asn Ser 

565 570 575 

Arg Asp Glu Val Ala Gin Met Tyr Cys Leu Val Lys Asp Trp Pro Pro 

580 585 590 

He Lys Pro Glu Gin Ala Met Glu Leu Leu Asp Cys Asn Tyr Pro Asp 
595 600 605 

Pro Met Val Arg Gly Phe Ala Val Arg Cys Leu Glu Lys Tyr Leu Thr 
610 615 620 

Asp Asp Lys Leu Ser Gin Tyr Leu He Gin Leu Val Gin Val Leu Lys 
625 630 635 640 

Tyr Glu Gin Tyr Leu Asp Asn Leu Leu Val Arg Phe Leu Leu Lys Lys 

645 650 655 



Ala Leu Thr Asn Gin Arg lie Gly His Phe Phe Phe Trp His Leu Lys 
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660 



665 



670 



Ser 
Glu 

Gin 
705 

Gin 

Glu 

Ser 

Cys 

Asn 
785 

Phe 

He 

Met 

Glu 

Gly 
865 

Leu 

Phe 

He 

Leu 

Phe 
945 

Leu 
Phe 
Gin 
Gly 



Glu Met His Asn Lys Thr Val Ser Gin Arg Phe Gly Leu Leu Leu 
675 680 665 

Ser Tyr Cy9 Arg Ala Cys Gly Met Tyr Leu Lys His Leu Asn Arg 
690 695 700 

Val Glu Ala Met Glu Lys Leu He Asn Leu Thr Asp He Leu Lys 

710 715 720 

Glu Lys Lys Asp Glu Thr Gin Lys Val Gin Met Lys Phe Leu Val 

725 730 735 

Gin Met Arg Arg Pro Asp Phe Met Asp Ala Leu Gin Gly Phe Leu 
740 745 750 

Pro Leu Asn Pro Ala His Gin Leu Gly Asn Leu Arg Leu Glu Glu 
755 760 765 

Arg He Met Ser Ser Ala Lys Arg Pro Leu Trp Leu Asn Trp Glu 
770 775 780 

Pro Asp He Met Ser Glu Leu His Phe Gin Asn Asn Glu He He 

790 795 800 

Lys Asn Gly Asp Asp Leu Arg Gin Asp Met Leu Thr Leu Gin He 

805 810 815 

Arg He Met Glu Asn He Trp Gin Asn Gin Gly Leu Asp Leu Arg 
820 825 830 

Leu Pro Tyr Gly Cys Leu Ser He Gly Asp Cys Val Gly Leu He 
835 840 845 

Val Val Arg Asn Ser His Thr He Met Gin He Gin Cys Lys Gly 
850 855 860 

Leu Lys Gly Ala Leu Gin Phe Asn Ser His Thr Leu His Gin Trp 

870 875 880 

Lys Asp Lys Asn Lys Gly Glu He Tyr Asp Ala Ala He Asp Leu 

885 890 895 

Thr Arg Ser Cys Ala Gly Tyr Cys Val Ala Thr Phe He Leu Gly 
900 905 910 

Gly Asp Arg His Asn Ser Asn He Met Val Lys Asp Asp Gly Gin 
915 920 925 

Phe His lie Asp Phe Gly His Phe Leu Asp His Lys Lys Lys Lys 
930 935 940 

Gly Tyr Lys Arg Glu Arg Val Pro Phe Val Leu Thr Gin Asp Phe 

950 955 960 

He Val lie Ser Lys Gly Ala Gin Glu Cys Thr Lys Thr Arg Glu 

965 970 975 

Glu Arg Phe Gin Glu Met Cys Tyr Lys Ala Tyr Leu Ala He Arg 
980 985 990 

His Ala Asn Leu Phe He Asn Leu Phe Ser Met Met Leu Gly Ser 
995 1000 1005 

Met Pro Glu Leu Gin Ser Phe Asp Asp lie Ala Tyr lie Arg Lys 
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1010 1015 1020 

Thr Leu Ala Leu Asp Lys Thr Glu Gin Glu Ala Leu Glu Tyr Phe Met 
1025 1030 1035 1040 

Lys Gin Met Asn Asp Ala His His Gly Gly Trp Thr Thr Lys Met Asp 

1045 1050 1055 

Trp He Phe His Thr lie Lys Gin His Ala Leu Asn 

1060 1065 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2199 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: SCHIZOSACCHAROMYCES POMBE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



AAAAATCTCA 


ACACATGTGA 


ATGATCAGAA 


AATTATCGCC 


ATAAAAGACA 


GAATAAGTCA 


60 


TCAGCGGTTG 


TTTCATTTCC 


TATATTTTTT 


TTTTATTTTT 


TTATTTTTTA 


ATAAGGGAAA 


120 


ATTTAACGTC 


TAAGGATACA 


GAAGATTGTT 


AGCACATTAA 


AGTAATAAAG 


GCTTAAGTAG 


180 


TAAGTGCCTT 


ACCATGTTAT 


TGTATTTCAA 


AGGACATAAT 


CTAAAATAAT 


AACAATATCA 


240 


TTTCTCACAA 


GTTATTCAAT 


TTT CTTTTTT 


TTTTCTAATA 


ATATCAAGAA 


TGTATTATTT 


300 


GTTTGACATA 


AGTCAACTAA 


TTTATTTAAT 


ATGCTGGATT 


AATCTTGCAG 


ACATGTAAAT 


360 


TAACAAGTTT 


TAGTCAAATA 


ACGTTGAAGT 


TTCAATGAAC 


TCAAATAATT 


TCTCTTTTTT 


420 


TTTATATAAC 


CATATGTCTA 


ATCTGATTTA 


TATTTTCCGC 


AGGATCAACT 


GAAGTTATGA 


480 


CATTTGGATT 


GGATCACTTA 


TAACCTTGGT 


CGCCAAATAA 


TACAAAAATC 


AGCGTTATAA 


540 


AACAAAGAAG 


GTTTTTGTTA 


AGAAATTAAT 


CCTCTTTCTT 


GATAAGAAAG 


TTGAACCGAA 


600 


ATTGCAGATA 


CTGATATATG 


AAAATAATAC 


CCACAATTTT 


GGGAATAGCG 


CAAGCCTCAA 


660 


TTTAAACAAT 


AGGTGAGGAC 


ACATGATAAT 


GACCTCAATG 


ATTGTTAGAA 


GAAAAGAGCC 


720 


TCATTACAAA 


ATCGAAAAAT 


GAATGGTTGG 


GTACAAGTTT 


CCAAAACATG 


GTAAAGTGGA 


780 


CTTTGCGTAT 


GAGACGTAAA 


TAGAAAAAAA 


CACTTGTTAT 


ATGTTTTCTA 


GAATTATTGT 


840 


TGTCTCTTTA 


TGGTTGGATG 


ATGCAAAATA 


GTAATTTCGG 


TTAGTTGCTG 


TAAAACACCA 


900 


CGAGACAAAT 


AGATATGGAT 


ATTTATTAAA 


TCAGGAAAAA 


CGTAACTCTC 


GGCTACTGGA 


960 


TGGTTCAGTC 


ACCCAACGAT 


TACTGGGGAG 


AGAAAACAGG 


GCAAAAGCAA 


AGCTTAAAGG 


1020 


AATCCGATTG 


TCATTCGGCA 


ATGTGCAGCG 


AAACTAAAAA 


CCGGATAATG 


GACCTGTTAA 


1080 
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TCGAAACATT GAAGATATAT AAAGGAAGAG GAATCCTGGC ATATCATCAA TTGAATAAGT 1140 

TGAATTAATT ATTTCAATCT CATTCTCACT TTCTGACTTA TAGTCGCTTT GTTAAATCAT 1200 

AGGAATGTCT CCCTTGCCAG TACTGCTAGG GTTTTTCTTT CAAACTATGG AAGCCCATTC 1260 

AAGCTGCATA TTACGATTTT GTTTTTCGCT TTTAGAAAGT GGTTTAGATG AGATAATAGA 1320 
AAAATTCTTG ATCTCCGACA ACGAGTACTT TTATTTTTTT TGCTAATCAC TTTACTCAAT ' 1380 

ATTAGCTCGA AATCGTAGAA ACGTAGACGG GTGCGGGATA CCGAGTGGTG TAGTTAAGAA 1440 

TTTTTATAAA CCACGTGGCC CAAAAATATG AACCCAAAAC GTTTATACAT GAGTATACTT 1500 

TAAGAAGGCT ATACCCCTTC GTGTTAGATG TAGTTTTAGC TACCCAACCC GAGTCTATGA 1560 
GCTTGACTTC AGATGTAGAA GGCATTAAAT CGTTTTGAAT ATTAATTAAA AAACGATGAA • 1620 

AATTAAATAT TTAAAAGCAA TCATACGCTG AAAATTTAGT GCTGTGGCTA ATCCTTCAAC 1680 

ATGGAAATGC CATAAAAGTG ACTTTGACAA AAAAAAAAGT ATATACAGGT ACTAAACTCA 1740 

TCTACTTCAT TGACTTTGTT TACAGCATGT GGAAGGAGGA ATATTTATTG CTAAATCGTA 1800 

GTTTAACATT CAATAAGTAA TACTATTGAA ATTCGACAAG ATTGGCCGCA TGGATGAAAA 1860 

AGAGGCATTT TGCTTTGGGA GAATTAGTTC AAATTAGAAC TGAAAAAAAA AACTTTACGA 1920 

GGCAAAAATG TCGGATTGAG ATCGTAAAAG TTCGCTCGTC GTCTTTTGCT TTGTGATTGT 1980 

TTTCATGGAT ACATCTTGCT GGATATTTAA ATTTTAGTAC TATGTATAAG ATATTCTATA 2040 

AATGTTTTAT CACCCAAACC TGTTAGCGCC TTCTTAATTC TATTCAATCT GGCTTTTGCT 2100 

CTGAGACTAC TTCTTGGACT TTCACTACTT GTTAGTTATA CGGAATTTGT GTAATTAGAA 2160 

GTGAAATAAT CCTTTCTATT AGTAATGCAA ACAAAAATC 2199 
<2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2707 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CTCGAGCTGA AGAACCAGCG AGGCGGCGAG GCAGCCCCCG CGGCTTGCAG CGGAGGCGAC 60 

AGCTCGTCTC CTGCCGTGGA GGTGTCGCCG GTGGTGGGGG GGAGAGACTT GCTCCAAAAA 120 

AACGGACGTC TCCAGCTCTC CCCCCTCCCT GTTTTCCGTT AGGAATCCGG CGAGGAAATA 180 

CATGCACTCG CTGAGAATCG GCGGCGCCAG GACGCAGCGC CACAAGGTGT AGCGAGTGAG 240 

TGGGGTGGGG CAAGAGGGGA CCCAGGAGTC CCCCAGGCTC CCGGCGCGCC TGCTCCTGCT 300 
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CTTCAATCCT GCCCACGGGG CGGACGGAGT GACCCCCGCC CCGACCATGG TAGTGTTCAA 
TGGCCTTCTT AAGATCAAAA TCTGCGAGGC GGTGAGCTTG AAGCCCACAG CCTGGTCGCT 
GCGCCATGCG GTGGGACCCC GGCCACAGAC GTTCCTTTTG GACCCCTACA TTGCCCTTAA 
CGTGGACGAC TCGCGCATCG GCCAAACAGC CACCAAGCAA AAGACCAACA GCCCGGCCTG 
GCACGATGAG TTCGTCACCG ATGTGTGCAA TGGGCGCAAG ATOGAGCTGG CTGTCTTTCA 
CCACGCTCCT ATCGGCTACG ACGACTTCGT GGCCAACTGC ACCATCCAGT TCGAGGAGCT 
GCTGCAGAAT GGGAGCCGTC ACTTOGAGGA CTGGATTGAC CTGGAGCCAG AAGGAAAAGT 
GTACGTGATC ATCGATCTCT CGGGATCATC GGGTGAAGCC CCTAAAGACA ATGAAGAACG 
AGTGTTCAGG GAGCGTATGC GGCCAAGGAA GCGGCAAGGG GCTGTCAGGC GCAGGGTCCA 
CCAGGTCAAT GGCCACAAGT TCATGGCCAC CTACTTGCGG CAACCCACCT ACTGCTCCCA 
CTGCAGAGAT TTCATCTGGG GTGTCATAGG AAAACAGGGA TATCAATGTC AAGTTTGCAC 
TTGCGTTGTC CACAAGCGAT GTCATGAGCT CATTATTACA AAGTGCGCTG GGCTGAAGAA 
ACAGGAAACC CCTGACGAGG TGGGCTCCCA ACGGTTCAGC GTCAACATGC CCCACAAGTT 
CGGGATCCAC AACTACAAGG TCCCCACGTT CTGTGACCAC TGTGGGTCCC TGCTCTGGGG 
CCTCTTGCGG CAGGGCTTGC AGTGTAAAGT CTGCAAAATG AATGTTCACC GGCGATGTGA 
GACCAACGTG GCTCCCAACT GTGGGGTAGA CGCCAGAGGA ATTGCCAAAG TGCTGGCTGA 
CCTCGGTGTT ACTCCAGACA AAATCACCAA CAGTGGCCAA AGGAGGAAAA AGCTCGCTGC 
TGGTGCTGAG TCCCCACAGC CGCCTTCTGC AAACTCCCCA TCTGAAGACG ACCGATCCAA 
GTCAGCGCOC ACCTCCCCTT GTGACCAGGA ACTAAAAGAA CTTGAAAACA ACATCCGGAA 
GGCCTTGTCA TTTGACAACC GAGGAGAGGA GCACCGAGCG TCGTCGGCCA CCGATGGCCA 
GCTGGCAAGC CCCGGAGAGA ATGGGGAAGT CCGGCCAGGC CAGGCCAAGC GCTTGGGGCT 
GGATGAGTTC AACTTCATCA AAGTGTTGGG CAAAGGCAGC TTTGGCAAGG TCATGTTGGC 
GGAACTCAAA GGCAAAGATG AAGTCTACGC TGTGAAGGTC TTGAAGAAGG ACGTTATCCT 
ACAAGACGAT GATGTGGACT GCACAATGAC AGAGAAGAGG ATTTTGGCTC TGGCTCGGAA 
ACACCCTTAT CTAACCCAAC TCTATTGCTG CTTCCAGACC AAGGACCGCC TCTTCTTCGT 
CATGGAATAT GTAAATGGTG GAGACCTCAT GTTCCAGATT CAGCGGTCCC GAAAATTTGA 
TGAGCCTCGT TCTCGGTTCT ATGCCGCAGA GGTCACATCG GCCCTCATGT TTCTCCACCA 
GCATGGAGTG ATCTACAGGG ATTTGAAACT GGACAACATC CTTCTACATG CAGAAGGCCA 
CTGCAAGCTG GCTGACTTTG GGATGTGCAA GGAAGGGATT ATGAATGGTG TGACAACTAC 
CACCTTCTGT GGGACTCCTG ACT AC AT AG C TCCAGAGATC CTACAGGAGT TGGAGTACGG 
CCCCTCAGTG GACTGGTGGG CCCTGGGGGT GCTGATGTAC GAGATGATGG CTGGGCAGCC 
CCCCTTTGAA GCTGACAACG AGGACGACTT GTTCGAATCC ATCCTTCATG ATGATGTTCT 
CTATCCTGTC TGGCTTAGCA AGGAAGCTGT CAGCATCCTG AAAGCTTTCA TGACCAAGAA 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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CCCGCACAAG CGCCTGGGCT GTGTGGCAGC GCAGAACGGG GAGGACGCCA TCAAGCAACA 
TCCATTCTTC AAGGAGATTG ACTGGGTACT GCTGGAGCAG AAGAAAATCA AGCCCCCCTT 
CAAGCCGAGA ATTAAAACCA AAAGAGATGT CAATAACTTT GACCAAGACT TTACGCGGGA 
AGAGCCAATA CTTACACTTG TGGATGAAGC AATCATTAAG CAGATCAACC AGGAAGAATT 
CAAAGGCTTC TCCTACTTTG GTGAAGACCT GATGCCCTGA GAGGCTGCTT CGGATGGAGG 
GAGCTCATGC TGCAAGGACG GTGTTGAGAT ACTCCCAAGC TGCAGAGGCT CCGAAGGTCT 
CAACTCCTCC TCCTCCTCCC CCTCCCCAGA GCCCCAGTCC CATGTCCACT CTCTTATTTA 
TTGCATT 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2167 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to roRNA 
(iii) HYPOTHETICAL: NO 



2340 
2400 
2460 
2520 
25B0 
2640 
2700 
2707 



(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GGCCCCTGTT 


CTGCAGAAAG 


GGGGCTCTGA 


GGCAGAAGGT 


GGTCCATGAG 


GTCAAGAGCC 


60 


ACAAGTTCAC 


CGCTCGCTTC 


TTCAAGCAGC 


CGACCTTCTG 


CAGCCACTGC 


ACTGACTTCA 


120 


TATGGGGGAT 


TGGAAAACAG 


GGTCTGCAAT 


GTCAAGTCTG 


CAGTTTTGTG 


GTTCATCGAC 


180 


GATGCCACGA 


GTTTGTGACC 


TTCGAGTGTC 


CAGGCGCTGG 


GAAGGGCCCC 


CAGACGGACG 


240 


ATCCCCGGAA 


CAAGCACAAG 


TTCCGTCTGC 


ACAGCTACAG 


CAGCCCCACC 


TTCTGCGACC 


300 


ACTGTGGCTC 


CCTGCTCTAC 


GGGCTGGTGC 


ACCAGGGCAT 


GAAGTGTTCT 


TGCTGCGAGA 


360 


TGAACGTGCA 


CCGGCGCTGT 


GTGCGCAGCG 


TGCCCTCTCT 


GTGCGGCGTG 


GACCACACGG 


420 


AGCGCCGGGG 


CCGCCTGCAG 


CTGGAGATCC 


GGGCGCCCAC 


TTCCGATGAG 


ATCCACGTTA 


480 


CGGTTGGCGA 


GGCCCGGAAC 


CTCATCCCAA 


TGGACCCCAA 


CGGTCTCTCC 


GATCCCTATG 


540 


TGAAGCTGAA 


GCTCATCCCA 


GACCCTCGGA 


ATTTGACCAA 


GCAGAAGACC 


CGCACGGTGA 


600 


AAGCTACGCT 


AAACCCTGTG 


TGGAACGAGA 


CCTTTGTGTT 


CAACCTGAAG 


CCGGGGGACG 


660 


TGGAGCGCCG 


GCTCAGCGTG 


GAGGTGTGGG 


ACTGGGACCG 


GACCTCCCGA 


AACGACTTCA 


720 


TGGGCGCCAT 


GTCCTTCGGC 


GTCTCGGAGC 


TGCTCAAGGC 


GCCGGTGGAC 


GGCTGGTACA 


780 


AGTTACTGAA 


CCAGGAGGAG 


GGCGAGTATT 


ACAATGTGCC 


GGTGGCTGAC 


GCCGACAACT 


840 


GCAACCTCCT 


CCAGAAGTTC 


GAGGCCTGTA 


ACTACCCCCT 


GGAACTATAC 


GAGAGGGTGC 


900 


GGACGGGTCC 


CTCTTCATCT 


CCCATCCCCT 


CCCCATCCCC 


CAGTCCCACC 


GACTCCAAGC 


960 
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GCTGTTTCTT 


CGGGGCCAGC 


CCTGGACGAC 


TGCACATCTC 


CGACTTCAGC 


TTCCTCATGG 


1020 


TTCTAGGAAA 


AGGCAGTTTT 


GGGAAGGTGA 


TGCTGGCCGA 


GCGCCGGGGC 


TCCGATGAGC 


1080 


TCTACGCCAT 


CAAGATCCTG 


AAGAAAGACG 


TGATCGTCCA 


GGATGACGAC 


GTGGAC7GCA 


1140 


CCCTGGTGGA 


GAAACGCGTG 


CTGGCTCTGG 


GGGGCCGAGG 


CCCGGGAGGC 


CGGCCGCACT 


1200 


TCCTCACCCA 


GCTTCACTCC 


ACCTTCCAGA 


CCCCGGATCG 


CCTGTATTTT 


GTGATGGAGT 


1260 


ATGTCACCGG 


GGGCGACTTG 


ATGTACCACA 


TTCAACAGCT 


GGGCAAGTTT 


AAGGAACCCC 


1320 


ACGCAGCGTT 


CTACGCTGCA 


GAAATCGCCA 


TCGGCCTCTT 


CTTCCTCCAT 


AACCAGGGCA 


1380 


TTATCTATCG 


GGACCTGAAA 


CTGGACAACG 


TGATGCTGGA 


TGCCGAAGGA 


CACATCAAAA 


1440 


TCACCGACTT 


CGGCATGTGT 


AAGGAGAACG 


TCTTTCCCGG 


GAGTACCACT 


CGCACCTTCT 


1500 


GCGGGACCCC 


GGACTACATA 


GCCCCCGAGA 


TCATTGCCTA 


CCAACCCTAT 


GGGAAGTCTG 


1560 


TGGATTGGTG 


GTCCTTTGGG 


GTTCTGCTCT 


ACGAGATGTT 


GGCAGGACAG 


CCCCCCTTTG 


1620 


ATGGAGAAGA 


TGAGGAGGAG 


CTGTTTCAAG 


CCATCATGGA 


ACAAACTGTC 


ACCTACCCCA 


1680 


AGTCGCTTTC 

X>V A Vv w X A A W 


CCGGGAAGCT 


G TGGC CATCT 


GCAAGGGGTT 


CCTGACCAAG 


CACCCGGCCA 


1740 


AGCGCCTGGG 


CTCAGG CCCC 

w a wn w ww ww 


GATGGAGAGC 


CCACCATCCG 


CGCTCACGGC 


TTTTTCCGCT 


1800 


GG AT CG A CTG 

wr» «i wwnwj. w 


GGACAGGCTG 


GAACGATTAG 


AGATCGCGCC 


TCCGTTCAGA 


CCCCGCCCGT 


1860 


GTGGCCGCAG 


CG GCG AG AAC 


TTCGACAAGT 


TCTTCACTCG 


GGCGG CGCCG 


GCGCTGACAC 


1920 


CCCCTGACCG 


CCTGGTTCTG 


GCCAGCATCG 


ACCAGGCTGA 


GTTCCAGGGC 


TTCACCTATG 


1980 


TCAACCCGGA 


TTTCGTGCAC 


CCGGATGCCC 


GCAGCCCCAT 


CAGCCGAACG 


CCTGTCCCAG 


2040 


TCATGTAATC 


CCACCTGCCG 


CCACCAGGCG 


TCCCCACGGC 


TCCCTCCTCC 


GCCCCGGCTT 


2100 


TGGCCCTCGC 


CTCACCATGC 


CACCCGCCTT 


TCCAATTCTA 


GATATGGCTC 


CCCAGCGTTC 


2160 


TGGCCTC 












2167 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GGCGGCGGCC GCGGGGATCC CGCGAGCGGC CCCTGAACAT CTACCCTTCT TGCCGGGACC 60 

CGGGAGGTCC CCACTGGCCT CCGGGCCCGT CCTGATCAGA CTCGTGTCGA CCTCCCCGTC 120 

CACGCGCATC CGGGAGAGCC GCGCCACGAG ACGGACCCGG GCCCGCCGGG ACCCCTGGTG 180 
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TCTGGCCCTG 


CGTCGAGAGG 


CTGGTGACTG 


CCACCCATAA 


GCTCCAGCTT 


CAGCCTCGGC 


240 


TTACTCCCCT 


CAGGGGCTTG 


CAGGCTGAGG 


CCTGCCCTCG 


GACGCGGCTG 


ACCAGCCTCT 


300 


CCCTCTCTTC 


CACACTTTGG 


ACTTCTCTTT 


GGACCTCCTA 


AAAAGGCTCC 


ATCATGGCAC 


360 


CGTTCCTGCG 


CATCTCCTTC 


AATTCCTATG 


AGCTGGGCTC 


CCTGCAGGCG 


GAGGACGACG 


420 


CAAGCCAGCC 


TTTCTGTGCC 


GTGAAGATGA 


AGGAGGCACT 


CACCAGAGAC 


CGAGGGAAGA 


480 


CTCTGGTACA 


GAAGAAGCCC 


ACAATGTACC 


CTGAGTGGAA 


GTCAACATTC 


GACGCCCACA 


540 


TCTATGAAGG 


CCGTGTCATC 


CAGATCGTGC 


TGATGCGGGC AGCTGAAGAC 


CCCATGTOGG 


600 


AGGTGACCGT 


GGGCGTGTCA 


GTGCTGGCTG 


AGCGCTGCAA 


GAAGAACAAC 


GGCAAGGCTG 


660 


AGTTCTGGCT 


GGACCTGCAG 


CCTCAGGCCA 


AGGTGCTGAT 


GTGTGTGCAG 


TATTTCCTGG 


720 


AGGATGGGGA 


TTGCAAACAG 


TCCATGCGTA 


GTGAGGAGGA 


GGCCATGTTC 


CCAACTATGA 


780 


ACOGCCGTGG 


AGCCATTAAA 


CAGGCGAAGA 


TTCACTACAT 


CAAGAACGAC 


GAGTTCATCG 


840 


CCACCTTCTT 


TGGGCAGCCC 


ACCTTCTGTT 


CTGTGTGCAA 


AGAGTTTGTC 


TGGGGCCTCA 


900 


ACAAGCAAGG 


CTACAAATGC 


AGGCAATGCA 


ACGCTGCCAT 


CCATAAGAAA 


TGCATCGACA 


960 


AGATTATCGG 


CCGCTGCACT 


GGCACTGCTA 


CCAATAGCCG 


GGACACCATC 


TTCCAGAAAG 


1020 


AACGCTTCAA 


CATCGACATG 


CCTCACCGAT 


TCAAGGTCTA 


TAACTACATG 


AGCCCCACCT 


1080 


TCTGTGACCA 


CTGTGGCACT 


TTGCTCTGGG 


GATTGGTGAA 


ACAGGGATTA 


AAGTGTGAAG 


1140 


ACTGCGGCAT 


GAATGTGCAC 


CACAAATGCC 


GGGAGAAGGT 


GGCCAACCTG 


TGTGGTATCA 


1200 


ACCAAAAGCT 


CTTGGCTGAG 


GCCTTGAACC 


AAGTGACCCA 


GAAAGCTTCC 


CGGAAGCCAG 


1260 


AGACACCAGA 


GACTGTCGGA 


ATATACCACG 


GATTCGAGAA 


GAAGACAGCT 


GTCTCTGGGA 


1320 


ATGACATCCC 


AGACAACAAC 


GGGACCTATG 


GCAAGATCTG 


GGAGGGGAGC 


AACCGGTGCC 


1380 


GCCTTGAGAA 


CTTCACCTTC 


CAGAAAGTAC 


TTGGCAAAGG 


CAGCTTTGGC 


AAGGTACTGC 


1440 


TTGCAGAACT 


GAAGGGCAAG 


GAAAGGTACT 


TTGCAATCAA 


GTACCTGAAG 


AAGGACGTGG 


1500 


TGTTGATCGA 


CGATGACGTG 


GAGTGCACCA 


TGGTGGAGAA 


GCGGGTGCTG 


GCGCTCGCCT 


1560 


GGGAGAATCC 


CTTCCTCACC 


CATCTCATCT 


GTACCTTCCA 


GACCAAGGAC 


CACCTCTTCT 


1620 


TTGTGATGGA 


GTTCCTCAAT 


GGGGGCGATC 


TGATGTTCCA 


CATTCAGGAC 


AAAGGCCGCT 


1680 


TCGAACTCTA 


CCGGGCTACG 


TTTTATGCAG 


CTGAGATCAT 


CTCCGGACTG 


CAGTTTCTAC 


1740 


ATGGCAAAGG 


CATCATTTAC 


AGGGACCTCA 


AGCTAGACAA 


TGTAATGCTG 


GACAAGGATG 


1800 


GCCACATCAA 


GATTGCTGAC 


TTCGGGATGT 


GCAAAGAGAA 


TATATTTGGG 


GAGAACCGGG 


1860 


CCAGCACATT 


CTGCGGCACT 


CCTGACTACA 


TCGCCCCTGA 


GATCCTGCAG 


GGCCTGAAGT 


1920 


ACTCATTTTC 


CGTGGACTGG 


TGGTCTTTTG 


GGGTCCTCCT 


CTATGAGATG 


CTCATTGGCC 


1980 


AGTCCCCCTT 


CCATGGTGAT 


GATGAGGACG 


AGCTCTTTGA 


GTCCATCCGG 


GTGGACACAC 

* 


2040 


CACACTACCC 


GCGCTGGATC 


ACCAAGGAGT 


CCAAGGACAT 


CATGGAGAAG 


CTCTTCGAGA 


2100 


GGGACCCTGC 


CAAGAGGCTG 


GGAGTAACAG 


CAAACATCAG 


GCTTCACCCC 


TTTTTCAAGA 


2160 



y/O 94/03609 



PCI7GB93/01651 




44 




CTATCAACTG 


GAACCTGCTG 


GAGAAGCGGA 


AGGTGGAGCC 


CCCCTTTAAG 


CCCAAAGTGA 


2220 


AATCCCCTTC 


AGACTACAGC 


AACTTTGACC 


CAGAGTTCCT 


GAATGAGAAA 


CCCCAACTTT 


2280 


CCTTCAGTGA 


CAAGAACCTC 


ATCGACTCTA 


TGGACCAGAC 


AGCCTTCAAG 


GGCTTCTCCT 


2340 


TTGTGAACCC 


CAAATATGAG 


CAATTCCTGG 


AATAG7GAGC 


TCCCAGACCT 


GCTTTTAATG 


2400 


CCCCGGCAGA 


GTAGGCCCAT 


CTGCCCTGGT 


TTGCATCCTC 


ACTGCCCATG 


AAGAAGAGTG 


2460 


GGTGACTGGT 


GATTCCTGCT 


GCTGCCCCCT 


CTTCCTCGGA 


GAGTCTGGCT 


CCTGTTGGCT 


2520 


GGGCTCACAG 


TACTTCCTCT 


GTGAACTGTT 


TGTGAATTTG 


CCTTCCTTTT 


GCCATCGGAG 


2580 


GGAAACTGTA 


AATCCTGTGT 


GTCATTACTT 


GAATGTAGT7 


ATTGAAATAT 


ATATTATATA 


2640 


TATGCACATA 


TATATAATAG 

mt mm ^» m m m* m mm m » • m^^ 


GCTGTATATA 


TTGCTCAGTA 


TAGAAAGCAT 


GTAGGAGACT 


2700 


GGTGATGTGT 


TGACCTTTTT 


TTAAAAAAAA 


CCATATGTAT 


ACGTGTGTAT 


GTATACATCT 


2760 


ACACACGTAT 


ACATATATGT 


ATGTATGTAT 


GTATGTATGT 


ATGTATATAT 


GACCAAAAGA 


2820 


AAAGAGAGGA 


CAAGCTACCT 


GAACCACAGG 


ATTGTTTATG 


TGTGTATAAA 


TAAACACTGA 


2880 


ATGGTAAAAA 


A 










2891 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
<iii) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



TCCGGGTTCC 


CCAGTGCCAG 


CCAGCGCGGC 


CCCCTCGGGG 


CTCCGGCAGC 


AGCGCCGGCA 


60 


TGTCGTCCGG 


CACGATGAAG 


TTCAATGGCT 


ATCTGAGGGT 


CCGCATCGGA 


GAGGCTGTAG 


120 


GGCTGCAGCC 


CACCCGCTGG 


TCCCTGCGGC 


ACTCGCTCTT 


CAAAAAGGGC 


CACCAGCTGC 


160 


TGGACCCCTA 


CCTGACGGTG 


AGCGTAGACC 


AGGTACGCGT 


GGGCCAGACC 


AGCACAAAGC 


240 


AGAAGACCAA 


CAAACCCACC 


TACAACGAGG 


AGTTCTGCGC 


CAATGTCACC 


GACGGCGGCC 


300 


ACCTGGAGCT 


AGCCGTCTTC 


CACGAGACGC 


CCCTGGGTTA 


TGACCACTTT 


GTGGCCAACT 


360 


GCACGCTGCA 


GTTCCAGGAG 


CTGTTGCGCA 


CGGCTGGTAC 


CTCGGACACC 


TTCGAGGGCT 


420 


GGGTGGATCT 


GGAGCCTGAG 


GGGAAAGTGT 


TTGTGGTAAT 


AACCCTAACA 


GGGAGTTTCA 


480 


CTGAAGCCAC 


TCTCCAGAGA 


GACCGCATCT 


TCAAGCATTT 


TACCAGGAAG 


CGCCAAAGGG 


540 


CTATGCGAAG 


ACGAGTCCAT 


CAAGTGAACG 


GACATAAGTT 


CATGGCCACG 


TACCTGAGGC 


600 


AGCCCACCTA 


CTGCTCTCAT 


TGCCGAGAGT 


TCATCTGGGG 


AGTATTTGGG 


AAACAGGGTT 


660 
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ATCAATGCCA 


AGTGTGCACC 


TGCGTCGTCC 


ATAAACGCTG 


CCATCATCTA 

mrm • mm ^m>m m mt mm mm * w 


ATTGTTACAG 

m m. mm mm *> •> • • *rWB mmm 


720 


CCTGCACTTG 


CCAAAACAAT 


ATTAACAAAG 


TGGATGCCAA 


GATTGCAGAA 


CAGCGGTTTG 


780 


GCATCAACAT 


CCCACACAAG 


TTCAACGTTC 


ACAATTACAA 


GGTGCCCACG 


TTCTGTGACC 


840 


ACTGTGGCTC 


CCTGCTCTGG 


GGGATAATGC 


GACAAGGACT 


TCAGTGTAAA 


ATATGTAAGA 


900 


TGAATGTACA 


TATTCGGTGT 


CAGGCGAACG 


TGGCCCCAAA 


CTGCGGGGTG 


AATGCCGTGG 


960 


AGCTTGCCAA 


GACCCTGGCA 


GGGATGGGTC 


TCCAACCCGG 


AAATATTTCT 


CCAACCTCGA 

¥ 


1020 


AACTCATTTC 


CAGGTCGACA 


CTAAGACGGC 


AGGGAAAGGA 


GGGCTCCAAA 


GAAGGAAATG 


1080 


GGATCGGTGT 


TAACTCTTCC 


AGCAGATTCG 


GCATCGACAA 


CTTTGAGTTC 


ATCCGCGTGT 


1140 


TGGGGAAGGG 


GAGCTTCGGG 


AAGGTGATGC 


TTGCCAGGAT 


AAAGGAGACA 


GGAGAACTGT 


1200 


ACGCCGTGAA 


GGTGCTGAAG 


AAGGACGTGA 


TTCTGCAGGA 


TGATGATGTA 


GAGTGCACCA 


1260 


TGACTGAGAA 


GAGGATCCTG 


TCCTTGGCTC 


GCAACCACCC 


CTTCCTCACC 


CAGCTCTTCT 


1320 

mm mW mm ™W 


GCTGCTTTCA 


GACTCCAGAC 


CGTCTGTTCT 


TTGTCATGGA 


GTTTGTGAAC 

m* mm V mm ^m m mm * 


GGAGGCGACC 


1380 

mm- mm* \mf 


TGATGTTCCA 


CATCCAAAAG 


TCCCGTCGTT 


TCGATGAAGC 


CCGTGCTCGT 


TTCTACGCCG 


1440 


CGGAGATCAT 


TTCTGCACTC 


ATGTTCCTAC 


ATGAGAAAGG 


TATCATCTAT 


AGAGACTTGA 


1500 

mm mmf mm \J 


AACTGGACAA 


TGTGCTATTG 


GACCACGAAG 


GTCACTGTAA 


ACTGGCCGAT 


TTTGGAATGT 


1560 

mm mw V mm* 


• 

GCAAGGAGGG 


GATTTGTAAT 


GGGGTCACCA 


CAGCCACCTT 


CTGCGGTACA 

* ^^^^^^ m* « • 


CCTGACTACA 


1620 

mm- V mm W 


TTGCCCCAGA 


GATCCTTCAG 


GAGATGCTGT 


ATGGACCTGC 


AGTAGACTGG 


TGGGCCATGG 


1680 

mm, W %m* V 


GCGTGTTCCT 


TTATGAGATG 


CTGTGCGGAC 


ATGCGCCCTT 


TGAGGCTGAA 


AATGAAGATG 

m mm m mm VfMlVO mm m mJ 


1740 


ACCTTTTTGA 


GGCCATACTG 


AATGATGAAG 


TCGTCTACCC 


C ACCTG GCTC 


CATGAAGATG 


1800 

mm «^ 


CCAGAGGGAT 


CCTCAAGTCT 


TTCATGACCA 


AGAACCCCAC 


CATGCGCTTG 


GGCAGCCTGA 


1860 


CTCAGGGAGG 


AGAGCATGAG 


ATCCTGAGAC 


ACCCTTTCTT 


TAAGGAAATC 


GACTGGGCCC 


1920 




J. ouULnu A 1a 


uauUwvLU IT 


TCCGACCTAG 


AATCAAATCC 


CGAGAAGATG 


1980 


TCAGCAATTT 


TGACCCAGAC 


TTTATAAAAG 


AAGAGCCCGT 


CTTAACTCCG 


ATTGATGAGG 


2040 


GACATCTTCC 


TATGATTAAC 


CAGGATGAGT 


TTAGAAACTT 


TTCCTATGTG 


TCACCGGAAT 


2100 


TGCAACTGTA 


GCCTTATGGG 


GAGTCAGAAC 


CAAAGGGGAA 


GGTGGATTTC 


TCCAGGAATT 


2160 


TCTTATGTGG 


GAATTC 










2176 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Met Asp Trp lie Phe His Thr 
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1 



5 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AARATGGAYT GGATHTTYCA YAC 23 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Asp Gly Gin Leu Phe His lie Asp Phe Gly His Phe 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GATGATGGCC ARCTGTTYCA YATWGAYTTT GGCCAYTT 38 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AATTCACACA CTGGCATGCC GAT 23 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GACTCGAGTC GACATCGATT TT TT TTTTTT TTTTT 35 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTTAAG CTTA GGCATTCTAA AGTCACTATC ATCCC 35 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GACTCGAGTC GACATCGA 



18 
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CLAIMS 



1. A eukaryotic cell transformed with a DNA construct comprising a 
coding sequence encoding a polypeptide having the activity of a 

5 mammalian phospholipid kinase or a mammalian protein kinase 

activated by a phospholipid or its metabolite and regulatory elements to 
allow transcription of the coding sequence in the said cell wherein the 
regulatory elements include a repressible or inducible promoter and the 
expression of the said coding sequence is lethal or growth inhibitory to 

10 the cell. 

2. A cell according to Claim 1 wherein the cell is a yeast cell. 

3. A cell according to Claim 2 wherein the yeast is Schizosaccharomyces. 

15 

4. A cell according to Claim 3 wherein the promoter is the runt promoter. 

5. A Schizosaccharomyces cell transformed with a DNA construct 
comprising a coding sequence encoding a polypeptide having the 

20 activity of a mammalian phospholipid kinase or a mammalian protein 

kinase activated by a phospholipid or its metabolite and regulatory 
elements to allow transcription of the coding sequence in the said cell 
wherein the regulatory elements include a constitutive promoter and the 
expression of the said coding sequence is lethal or growth inhibitory to 

25 the cell. 



6. A Schizosaccharomyces cell according to Claim 5 wherein the promoter 
is the adh promoter. 



30 7. A cell according to any one of the preceding claims wherein the 
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phospholipid kinase is an inositol phospholipid kinase. 



15 



20 




8. A cell according to any one of Claims 1 to 6 wherein the protein kinase 
activated by a phospholipid or its metabolite is a protein kinase C. 

5 

9. A cell according to Claim 7 wherein the phospholipid kinase is selected 
from the group consisting of phosphatidyl inositol 3-kinase, 
phosphatidyl inositol 4-kinase and phosphatidyl inositol-5-kinase. 

10 10. A cell according to Claim 9 wherein the phospholipid kinase is 

phosphatidyl inositol 3-kinase. 



11. A cell according to Claim 8 wherein the protein kinase C is selected 
from any one of PKC-7, PKC-S, PKC-ij or PKC-e. 



12. An assay for detecting whether a compound is involved in cell growth 
regulation, the assay comprising (1) a cell according to any one of the 
preceding claims, (2) a container for the said cell, (3) a growth medium 
for the said cell and (4) means to detect the viability of the cell. 

13. A kit comprising a eukaryotic cell as defined in Claim 1 and culture 
medium such that the cell will divide and grow. 



14. A method for assaying for a compound that is involved in cell growth 
25 regulation the method comprising (1) culturing a cell as defined in 

Claim 1, (2) adding a compound and (3) determining the cell growth 
rate in the presence of the compound. 



15. 

30 



A compound identified by the assay of Claim 12 or the method of 
Claim 14 as being involved in cell growth regulation. 
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AAAAATCTCA ACACATGTGA ATGATCAGAA AATTATCGCC ATAAAAGACA GAATAAGTCA 
TCAGCGGTTG TTTCATTTCC TATATTTTTT TTTTATTTTT TTATTTTTTA ATAAGGGAAA 
ATTTAACGTC TAAGGATACA GAAGATTGTT AGCACATTAA AGTAATAAAG GCTTAAGTAG 
TAAGTGCCTT AGCATGTTAT TGTATTTCAA AGGACATAAT CTAAAATAAT AACAATATCA 
TTTCTCACAA GTTATTCAAT TTTCTTTTTT TTTTCTAATA ATATCAAGAA TGTATTATTT 
GTTTGACATA AGTCAACTAA TTTATTTAAT ATGCTGGATT AATCTTGCAG ACATGTAAAT 
TAACAAGTTT TAGTCAAATA ACGTTGAAGT TTCAATGAAC TCAAATAATT TCTCTTTTTT 
TTTATATAAC CATATGTCTA ATCTGATTTA TATTTTCCGC AGGATCAACT GAAGTTATGA 
CATTTGGATT GGATCACTTA TAACCTTGGT CGCCAAATAA TACAAAAATC AGCGTTATAA 
AACAAAGAAG GTTTTTGTTA AGAAATTAAT CCTCTTTCTT GATAAGAAAG TTGAACCGAA 
ATTGCAGATA CTGATATATG AAAATAATAC CCACAATTTT GGGAATAGCG CAAGCCTCAA 
TTTAAACAAT AGGTGAGGAC ACATGATAAT GACCTCAATG ATTGTTAGAA GAAAAGAGCC 
TCATTACAAA ATCGAAAAAT GAATGGTTGG GTACAAGTTT CCAAAACATG GTAAAGTGGA 
CTTTGCGTAT GAGACGTAAA TAGAAAAAAA CACTTGTTAT ATGTTTTCTA GAATTATTGT 
TGTCTCTTTA TGGTTGGATG ATGCAAAATA GTAATTTCGG TTAGTTGCTG TAAAACACCA 
CGAGACAAAT AGATATGGAT ATTTATTAAA TCAGGAAAAA CGTAACTCTC GGCTACTGGA 
TGGTTCAGTC ACCCAACGAT TACTGGGGAG AGAAAACAGG GCAAAAGCAA AGCTTAAAGG 
AATCCGATTG TCATTCGGCA ATGTGCAGCG AAACTAAAAA CCGGATAATG GACCTGTTAA 
TCGAAACATT GAAGATATAT AAAGGAAGAG GAATCCTGGC ATATCATCAA TTGAATAAGT 
TGAATTAATT ATTTCAATCT CATTCTCACT TTCTGACTTA TAGTCGCTTT GTTAAATCAT 
AGGAATGTCT CCCTTGCCAG TACTGCTAGG GTTTTTCTTT CAAACTATGG AAGCCCATTC 
AAGCTGCATA TTACGATTTT GTTTTTCGCT TTTAGAAAGT GGTTTAGATG AGATAATAGA 
AAAATTCTTG ATCTCCGACA ACGAGTACTT TTATTTTTTT TGCTAATCAC TTTACTCAAT 
ATTAGCTCGA AATCGTAGAA ACGTAGACGG GTGCGGGATA CCGAGTGGTG TAGTTAAGAA 
TTTTTATAAA CCACGTGGCC CAAAAATATG AACCCAAAAC GTTTATACAT GAG TATA CTT 
TAAGAAGGCT ATACCCCTTC GTGTTAGATG TAGTTTTAGC TACCCAACCC GAGTCTATGA 
GCTTGACTTC AGATGTAGAA GGCATTAAAT CGTTTTGAAT ATTAATTAAA AAACGATGAA 
AATTAAATAT TTAAAAGCAA TCATACGCTG AAAATTTAGT GCTGTGGCTA ATCCTTCAAC 



ATGGAAATGC CATAAAAGTG ACTTTGACAA AAAAAAAAGT ATATACAGGT AGTAAACTCA 
TCTACTTCAT TGACTTTGTT TACAGCATGT GGAAGGAGGA ATATTTATTG CTAAATCGTA 
GTTTAACATT CAATAAGTAA TACTATTGAA ATTCGACAAG ATTGGCCGCA TGGATGAAAA 
AGAGGCATTT TGCTTTGGGA GAATTAGTTC AAATTAGAAC TGAAAAAAAA AACTTTACGA 
GGCAAAAATG TCGGATTGAG ATCGTAAAAG TTCGCTCGTC GTCTTTTGCT TTGTGATTGT 
TTTCATGGAT A CATCTTG CT GGATATTTAA ATTTTAGTAC TATGTATAAG ATATTCTATA 
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AATGTTTTAT CACCCAAACC TGTTAGCGCC TTCTTAATTC TATTCAATCT GGCTTTTGCT 2100 

CTGAGACTAC TTCTTGGACT TTCACTACTT CTTAGTTATA CGGAATTTGT GTAATTAGAA 2160 

GTGAAATAAT CCTTTCTATT AGTAATGCAA ACAAAAATC 2199 
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CTCGAGCTGA AGAACCAGCG AGGCGGCGAG GCAGCCCCCG CGGCTTGCAG CGGAGGCGAC 
AGCTCGTCTC CTGCCGTGGA GGTGTCGCCG GTGGTGGGGG GGAGAGACTT GCTCCAAAAA 
AACGGACGTC TCCAGCTCTC CCCCCTCCCT CTTTTCCGTT AGGAATCCGG CGAGGAAATA 
CATGCACTCG CTGAGAATCG GCGGCGCCAG GAGGCAGCGC CACAAGGTGT AGCGAGTGAG 
TGGGGTGGGG CAAGAGGGGA CCCAGGAGTC CCCCAGGCTC CCGGCGCGCC TGCTCCTGCT 
CTTCAATCCT GCCCACGGGG CGGACGGAGT GACCCCCGCC CCGACCATGG TAGTGTTCAA 
TGGCCTTCTT AAGATCAAAA TCTGCGAGGC GGTGAGCTTG AAGCCCACAG CCTGGTCGCT 
GCGCCATGCG GTGGGACCCC GG CCA CAG AC GTTCCTTTTG GACCCCTACA TTGCCCTTAA 
CGTGGACGAC TCGCGCATCG GCCAAACAGC CACCAAGCAA AAGACCAACA GCCCGGCCTG 
GCACGATGAG TTCGTCACCG ATGTGTGCAA TGGGCGCAAG ATCGAGCTGG CTGTCTTTCA 
CGACGCTCCT ATCGGCTACG ACGACTTCGT GGCCAACTGC ACCATCCAGT TCGAGGAGCT 
GCTGCAGAAT GGGAGCCGTC ACTTCGAGGA CTGGATTGAC CTGGAGCCAG AAGGAAAAGT 
GTACGTGATC ATCGATCTCT CGGGATCATC GGGTGAAGCC CCTAAAGACA ATGAAGAACG 
AGTGTTCAGG GAGCGTATGC GGCCAAGGAA GCGGCAAGGG GCTGTCAGGC GCAGGGTCCA 
CCAGGTCAAT GGCCACAAGT TCATGGCCAC CTACTTGCGG CAACCCACCT ACTGCTCCCA 
CTGCAGAGAT TTCATCTGGG GTGTCATAGG AAAACAGGGA TATCAATGTC AAGTTTGCAC 
TTGCGTTGTC CACAAGCGAT GTCATGAGCT CATTATTACA AAGTGCGCTG GGCTGAAGAA 
ACAGGAAACC CCTGACGAGG TGGGCTCCCA ACGGTTCAGC GTCAACATGC CCCACAAGTT 
CGGGATCCAC AACTACAAGG TCCCCACGTT CTGTGACCAC TGTGGGTCCC TGCTCTGGGG 
CCTCTTGCGG CAGGGCTTGC AGTGTAAAGT CTGCAAAATG AATGTTCACC GGCGATGTGA 
GACCAACGTG GCTCCCAACT GTGGGGTAGA CGCCAGAGGA ATTGCCAAAG TGCTGGCTGA 
CCTCGGTGTT ACTCCAGACA AAATCACCAA CAGTGGCCAA AGGAGGAAAA AGCTCGCTGC 
TGGTGCTGAG TCCCCACAGC CGGCTTCTGG AAACTCCCCA TCTGAAGACG ACCGATCCAA 
GTCAGCGCCC ACCTCCCCTT GTGACCAGGA ACTAAAAGAA CTTGAAAACA A CATC CGG AA 
GGCCTTGTCA TTTGACAACC GAGGAGAGGA GCACCGAGCG TCGTCGGCCA CCGATGGCCA 
GCTGGCAAGC CCCGGAGAGA ATGGGGAAGT CCGGCCAGGC CAGGCCAAGC GCTTGGGGCT 
GGATGAGTTC AACTTCATCA AAGTGTTGGG CAAAGGCAGC TTTGGCAAGG TCATGTTGGC 
GGAACTCAAA GGCAAAGATG AAGTCTACGC TGTGAAGGTC TTGAAGAAGG ACGTTATCCT 
ACAAGACGAT GATGTGGACT GCACAATGAC AGAGAAGAGG ATTTTGGCTC TGGCTCGGAA 
ACACCCTTAT CTAACCCAAC TCTATTGCTG CTTCCAGACC AAGGACCGCC TCTTCTTCGT 
CATGGAATAT GTAAATGGTG GAGACCTCAT GTTCCAGATT CAGCGGTCCC GAAAATTTGA 
TGAGCCTCGT TCTCGGTTCT ATGCCGCAGA GGTCACATCG GCCCTCATGT TTCTCCACCA 
GCATGGAGTG ATCTACAGGG ATTTGAAACT GGACAACATC CTTCTAGATG CAGAAGGCCA 
CTGCAAGCTG GCTGACTTTG GGATGTGCAA GGAAGGGATT ATGAATGGTG TGACAACTAC 
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CACCTTCTGT GGGACTCCTO ACTACATAGC TCCAGAGATC CTACAGGAGT TGGAGTACGG 
CCCCTCAGTG GACTGGTGGG CCCTGGGGGT GCTGATGTAC GAGATGATGG CTGGGCAGCC 
CCCCTTTGAA GCTGACAACG AGGACGACTT GTTCGAATCC ATCCTTCATG ATGATGTTCT 
CTATCCTGTC TGGCTTAGCA AGGAAGCTGT GAGCATCCTG AAAGCTTTCA TGACCAAGAA 
CCCGCACAAG CCCCTGGGCT GTGTGGCAGC GGAGAACGGG GAGGAOGCCA TCAAGCAACA 
TCCAXTCTTC AAGGAGATTG ACTGGGTACT GCTGGAGCAG AAGAAAATCA AGCCCCCCTT 
CAAGCCGAGA ATTAAAACCA AAAGAGATGT CAATAACTTT GACCAAGACT TTACGCGGGA 
AGAGCCAATA CTTACACTTG TGGATGAAGC AATCATTAAG CAGATCAACC AGGAAGAATT 
CAAAGGCTTC TCCTACTTTG GTGAAGACCT GATGCCCTGA GAGGCTGCTT CGGATGGAGG 
GAGCTCATGC TCCAAGGACG GTGTTGAGAT ACTCCCAAGC TGCAGAGGCT CCCAAGGTCT 
CAACTCCTCC TCCTCCTCCC CCTCCCCAGA GCCCCAGTCC CATGTCCACT CTCTTATTTA 
TTGCATT 
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GGCCCCTGTT 


CTGCAGAAAG 


GGGGCTCTGA 


GGCAGAAGGT 


GGTCCATGAG 


GTCAAGAGCC 


60 


ACAAGTTCAC 


CGCTCGCTTC 


TTCAAGCAGC 


CGACCTTCTG 


CAGCCACTGC 


ACTGACTTCA 


120 


TATGGGGGAT 


TGGAAAACAG 


GGTCTGCAAT 


GTCAAGTCTG 


CAGTTTTGTG 


GTTCATCGAC 


180 


GATGCCACGA 


GTTTGTGACC 


TTCGAGTGTC 


CAGGCGCTGG 


GAAGGGCCCC 


CAGACGGACG 


240 
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TCATGTAATC CCACCTGCCG CCACCAGGCG TCCCCACGGC TCCCTCCTCC GCCCCGGCTT 2100 

TGGCCCTCGC CTCACCATGC CACCCGCCTT TCCAATTCTA GATATGGCTC CCCAGCGTTC 2160 

TGGCCTC 2167 
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GGCGGCGGCC GCGGGGATCC CGCGAGCGGC CCCTGAACAT CTACCCTTCT TGCCGGGACC 
CGGGAGGTCC CCACTGGCCT CCGGGCCCGT CCTGATCAGA CTCGTGTCCA CCTCCCCGTC 
CACGCGCATC CGGGAGAGCC GCGCCACGAG ACGGACCCGG GCCCGCOGGG ACCCCTGGTG 
TCTGGCCCTG CGTCGAGAGG CTGGTGACTG CCACCCATAA GCTCCAGCTT CAGCCTCGGC 
TTACTCCCCT CAGGGGCTTG CAGGCTGAGG CCTGCCCTCG GACGCGGCTG ACCAGCCTCT 
CCCTCTCTTC CACACTTTGG ACTTCTCTTT GGACCTCCTA AAAAGGCTCC ATCATGGCAC 
CGTTCCTGCG CATCTCCTTC AATTCCTATG AGCTGGGCTC CCTGCAGGCG GAGGACGACG 
CAAGCCAGCC TTTCTGTGCC GTGAAGATGA AGGAGGCACT CACCACAGAC CGAGGGAAGA 
CTCTGGTACA GAAGAAGCCC ACAATGTACC CTGAGTGGAA GTCAACATTC GACGCCCACA 
TCTATGAAGG CCGTGTCATC CAGATCGTGC TGATGCGGGC AGCTGAAGAC CCCATGTCGG 
AGGTGACCGT GGGCGTGTCA GTGCTGGCTG AGCGCTGCAA GAAGAACAAC GGCAAGGCTG 
AGTTCTGGCT GGACCTGCAG CCTCAGGCCA AGGTGCTGAT GTGTGTGCAG TATTTCCTGG 
AGGATGGGGA TTGCAAACAG TCCATGCGTA GTGAGGAGGA GGCCATGTTC CCAACTATGA 
ACCGCCGTGG AGCCATTAAA CAGGCCAAGA TTCACTACAT CAAGAACCAC GAGTTCATCG 
CCACCTTCTT TGGGCAGCCC ACCTTCTGTT CTGTGTGCAA AGAGTTTGTC TGGGGCCTCA 
ACAAGCAAGG CTACAAATGC AGGCAATGCA ACGCTGCCAT CCATAAGAAA TGCATCGACA 
AGATTATCGG CCGCTGCACT GGGACTGCTA CCAATAGCCG GGACACCATC TTCCAGAAAG 
AACGCTTCAA CATCGACATG CCTCACCGAT TCAAGGTCTA TAACTACATG AGCCCCACCT 
TCTGTGACCA CTGTGGCACT TTGCTCTGGG GATTGGTGAA ACAGGGATTA AAGTGTGAAG 
ACTGCGGCAT GAATGTGCAC CACAAATGCC GGG AGAAGGT GGCCAACCTG TGTGGTATCA 
ACCAAAAGCT CTTGGCTGAG GCCTTGAACC AAGTGACCCA GAAAGCTTCC CGGAAGCCAG 
AGACACCAGA GACTGTCGGA ATATACCAGG GATTCGAGAA GAAGACAGCT GTCTCTGGGA 
ATGACATCCC AGACAACAAC GGGACCTATG GCAAGATCTG GGAGGGGAGC AACCGGTGCC 
GCCTTGAGAA CTTCACCTTC CAGAAAGTAC TTGGCAAAGG CAGCTTTGGC AAGGTACTGC 
TTGCAGAACT GAAGGGCAAG GAAAGGTACT TTGCAATCAA GTACCTGAAG AAGGACGTGG 
TGTTGATCGA CGATGACGTG GAGTGCACCA TGGTGGAGAA GCGGGTGCTG GCGCTCGCCT 
GGGAGAATCC CTTCCTCACC CATCTCATCT GTACCTTCCA GACCAAGGAC CACCTCTTCT 
TTGTGATGGA GTTCCTCAAT GGGGGCGATC TGATGTTCCA CATTCAGGAC AAAGGCCGCT 



TCGAACTCTA CCGGGCTACG TTTTATGCAG CTGAGATCAT CTGCGGACTG CAGTTTCTAC 
ATGGCAAAGG CATCATTTAC AGGGACCTCA AGCTAGACAA TGTAATGCTG GACAAGGATG 
GCCACATCAA GATTGCTGAC TTCGGGATGT GCAAAGAGAA TATATTTGGG GAGAACCGGG 
CCAGCACATT CTGCGGCACT CCTGACTACA TCGCCCCTGA GATCCTGCAG GGCCTGAAGT 
ACTCATTTTC CGTGGACTGG TGGTCTTTTG GGGTCCTCCT CTATGAGATG CTCATTGGCC 
AGTCCCCCTT CCATGGTGAT GATGAGGACG AGCTCTTTGA GTCCATCCGG GTGGACACAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1B00 
1860 
1920 
1980 
2040 



Figure 5; page 1 of 2 



SUBSTITUTE SHEET 



WO 94/03609 



PCT/GB93/01651 




CACACTACCC 


GCGCTGGATC 


ACCAAGGAGT 


GGGACCCTGC 


CAAGAGGCTG 


GGAGTAACAG 


CTATCAACTG 


GAACCTGCTG 


GAGAAGCGGA 


AATCCCCTTC 


AGACTACAGC 


AACTTTGACC 


CCTTCAGTGA 


CAAGAACCTC 


ATCGACTCTA 


TTGTGAACCC 


CAAATATGAG 


GAATTCCTGG 


CCCCGG GAGA 


Min% ^^^^^^^ m 

G T AGG CC CAT 


CTGCCCTGGT 


Ula Xaj AC rQGT 


GATTCCTGCT 


GCTGCCCCCT 


vV»w CTCACAG 


TACTTCCTCT 


GTGAACTGTT 


utfAAAU 1 bin 


AATCUTGTGT 


GTCATTACTT 




TATATAATAG 


GCTGTATATA 


GCTGATGTGT 


TGACCTTTTT 


TTaAAAAAAA 


ACACACGTAT 


ACATATATGT 


ATGTATGTAT 


AAAGAGAGCA 


CAAGCTACCT 


GAACCACAGG 


ATGGTAAAAA 


A 





11/17 




CTCTTCGAGA 


2100 


TTTTTCAAGA 


2160 


CCCAAAGTGA 


2220 


CCCCAACTTT 


2280 


GG CTTCTCCT 


2340 


GCXxX^AATG 


2400 


AAGAAGAGTG 


2460 


CCTGTTGG CT 


2520 


G CCAT CGG AG 


2580 


ATATTATATA 


^b4U 


v X Atf l» ASvAWT 




GTATACATCT 


2760 


GACCAAAAGA 


2820 


TAAACACTGA 


2880 




2891 



Figure 5; page 2 off 2 



SUBSTITUTE SHEET. 



WO 94/03609 PCT/GB93/01651 




12/17 



TCCGGGTTCC 


CCAGTGCCAG 


CCAGCGCGGC 


CCCCTCGGGG 


CTCCGGCAGC 


AGCGCCGGCA 


60 


TGTCGTCCGG 


CACGATGAAG 


TTCAATGGCT 


ATCTGAGGGT 


CCGCATCGGA 


GAGGCTGTAG 


120 


GGCTGCAGCC 


CACCCGCTGG 


TCCCTGCGGC 


ACTCGCTCTT 


CAAAAAGGGC 


CACCAGCTGC 


180 


TGGACCCCTA 


CCTGACGGTG 


AGCGTAGACC 


AGGTACGCGT 


GGGCCAGACC 


AGCACAAAGC 


240 


AGAAGACCAA 


CAAACCCACC 


TACAACGAGG 


AGTTCTGCGC 


CAATGTCACC 


GACGGCGGCC 
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ACCTGGAGCT 


AGCCGTCTTC 


cacgagacgc 


CCCTGGGTTA 


TGACCACTTT 


GTGGCCAACT 


360 


GCACGCTGCA 


GTTCCAGGAG 


CTGTTGCGCA 


CCGCTGGTAC 


CTCGGACACC 


TTCGAGGGCT 


420 


GGGTGGATCT 


GGAGCCTGAG 


GGGAAAGTGT 


TTGTGGTAAT 


AACCCTAACA 


GGGAGTTTCA 


480 


CTGAAGCCAC 


TCTCCAGAGA 


GACCGCATCT 


TCAAGCATTT 


TACCAGGAAG 


CGCCAAAGGG 
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CTATGCGAAG 


ACGAGTCCAT 


CAAGTGAACG 
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TACCTGAGGC 
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GACATCTTCC TATCATTAAC CAGGATGAGT TTAGAAACTT TTCCTATGTG TCACCGGAAT 2100 

TGCAACTGTA GCCTTATGGG GAGTCAGAAC CAAAGGGGAA GGTGGATTTC TCCAGGAATT 2160 

TCTTATGTGG GAATTC 2176 
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